Data

Make Dataset

class src.data.make_dataset.MakeDataset(file_name)

Bases: object

Class creating pandas DataFrame from raw file_name.

Creates MakeDataset.data attribute, which represents pandas DataFrame from imported project/data/raw/file_name file. Files from other directories or files that are not delimited by tabulation (delimiter = “t”) will raise errors.

Parameters:
file_namestr

Raw file name to be imported. Has to be in data/raw/ folder.

Notes

Imported raw file should be a .csv file.

Required libraries:

  • import pandas as pd

  • import os

Attributes:
datapandas DataFrame

imported file

delimiter :str

Delimiter in imported raw file. Used as pandas read_csv parameter.

Methods

__init__(self, file_name)

Constructor method.

_import_dataset(self)

Imports file_name and returns as self.data pandas DataFrame.

_import_dataset()

Importing raw file from project/data/raw/ folder.

Establishing raw data relative location based script location.

Returns:
datapandas DataFrame

Imported file.