tensorbay.opendataset.Newsgroups20

tensorbay.opendataset.Newsgroups20(path)[source]

20 Newsgroups dataset.

The folder structure should be like:

<path>
    20news-18828/
        alt.atheism/
            49960
            51060
            51119
            51120
            ...
        comp.graphics/
        comp.os.ms-windows.misc/
        comp.sys.ibm.pc.hardware/
        comp.sys.mac.hardware/
        comp.windows.x/
        misc.forsale/
        rec.autos/
        rec.motorcycles/
        rec.sport.baseball/
        rec.sport.hockey/
        sci.crypt/
        sci.electronics/
        sci.med/
        sci.space/
        soc.religion.christian/
        talk.politics.guns/
        talk.politics.mideast/
        talk.politics.misc/
        talk.religion.misc/
    20news-bydate-test/
    20news-bydate-train/
    20_newsgroups/
Parameters

path (str) – The root directory of the dataset.

Returns

Loaded Dataset instance.

Return type

tensorbay.dataset.dataset.Dataset