datasets.domains¶
This submodule is responsible for making data about the classification of email domains available in Python memory.
The data is stored in a CSV file that is provided with the BigBang repository.
This file was generated using a script that is provided for the library for reproducibility.
The script can be found in Create Domain-Category Data.ipynb
-
bigbang.datasets.domains.domains.
load_data
()¶ Returns a datafarme with email domains labeled by category.
Categories include: generic, personal, company, academic, sdo
- Returns
data
- Return type
pandas.DataFrame