Dora is a library designed to simplify the exploratory data analysis which is such a painful part. It automates the repetitive tasks that consume most of the time.
The library has functions that are very convenient for data cleaning, visualization, feature extraction and selection, visualization. Apart from this, it is also used for model validation by partitioning data, and transformations of data.
This library uses scikit-learn, pandas, and matplotlib. The intention of this library is to add additional features to general library mentioned before for exploratory data analysis.
pip install Dora
In-order to implement it in datasets use the below syntax:
It can be used for :
- Reading Data & Configuration
- Feature Selection & Extraction
- Model Validation
- Data Versioning
Below is the most basic implementation of Dora module on a dataset in Python:
Share your thoughts in the comments
Please Login to comment...