Automated type inference for Machine Learning pipelines.
In the context of tabular data, type_infer
aims for optimal interpretation of each column’s data type for ML use cases. For example, strings with date or time format would be classified as timestamps, or integers as categorical if there is a sufficiently small set of unique values in the column.
Install the package easily using pip:
pip install type_infer
Note: We recommend using a Python virtual environment.
To set up a development environment:
- Clone the repository:
git clone https://github.com/mindsdb/type_infer.git
- Navigate to the cloned directory and install in editable mode:
cd type_infer
pip install --editable .
Tp run unit tests execute:
python -m unittest discover tests
We welcome and appreciate contributions from the community! Here's how you can help:
- Report bugs
- Improve documentation
- Solve open issues
- Propose or discuss new features
- Test with your own datasets and provide feedback