datasets.domains

This submodule is responsible for making data about the classification of email domains available in Python memory.

The data is stored in a CSV file that is provided with the BigBang repository.

This file was generated using a script that is provided for the library for reproducibility. The script can be found in Create Domain-Category Data.ipynb

bigbang.datasets.domains.domains.load_data()

Returns a datafarme with email domains labeled by category.

Categories include: generic, personal, company, academic, sdo

Returns

data

Return type

pandas.DataFrame