Data
Make Dataset
- class src.data.make_dataset.MakeDataset(file_name)
Bases:
object
Class creating pandas DataFrame from raw file_name.
Creates MakeDataset.data attribute, which represents pandas DataFrame from imported project/data/raw/file_name file. Files from other directories or files that are not delimited by tabulation (delimiter = “t”) will raise errors.
- Parameters:
- file_namestr
Raw file name to be imported. Has to be in data/raw/ folder.
Notes
Imported raw file should be a .csv file.
Required libraries:
import pandas as pd
import os
- Attributes:
- datapandas DataFrame
imported file
- delimiter :str
Delimiter in imported raw file. Used as pandas read_csv parameter.
Methods
__init__(self, file_name)
Constructor method.
_import_dataset(self)
Imports file_name and returns as self.data pandas DataFrame.
- _import_dataset()
Importing raw file from project/data/raw/ folder.
Establishing raw data relative location based script location.
- Returns:
- datapandas DataFrame
Imported file.