My name is Sole, the leading instructor at Train in Data and the maintainer of Feature-engine, and together with a group of passionate data scientists and software developers, we maintain and expand the functionality of this Python library for feature engineering and feature selection for machine learning and its documentation, so you can better prepare your data to craft more powerful and interpretable machine learning models.
At Train in Data, we create intermediate and advanced online courses on machine learning, data science and AI software development, to help you boost your data science skills and leverage the power of this and other popular Python libraries, to create faster and robust machine learning pipelines.
We talk, blog and participate in podcasts about machine learning, software development and open-source, so you'll hear about us a lot on the digital sphere ;)
Check out the courses that we teach.
Courses | What you will learn |
---|---|
Feature engineering for machine learning | Learn to create new features, impute missing data, encode categorical variables, transform and discretize features and much more. |
Feature selection for machine learning | Learn to select features using wrapper, filter, embedded and hybrid methods, and build simpler and reliable models. |
Hyperparameter optimization for machine learning | Learn about grid and random search, Bayesian Optimization, Multi-fidelity models, Optuna, Hyperopt, Scikit-Optimize and more. |
Machine learning with imbalanced data | Learn about under- and over-sampling, ensemble and cost-sensitive methods and improve the performance of models trained on imbalanced data. |
Feature engineering for time series forecasting | Learn to create lag and window features, impute data in time series, encode categorical variabes and much more, specifically for forecasting. |
Forecasting with Machine Learning | Learn to perform time series forecasting with machine learning models like linear regression, random forests and xgboost. |
Machine Learning Interpretability | Learn to interpret the predictions of your white box and black box machine learning models. |
Find out more about machine learning through our books, and have the code at your fingertips.
Books | Summary |
---|---|
Python feature engineering Cookbook, second edition | Over 70 Python recipes to implement feature engineering in tabular, transactional, time series and text data. |
Feature selection in machine learning with Python | Over 20 methods to select the most predictive features and build simpler, faster, and more reliable machine learning models. |
The open-source libraries I contribute to.
Library | About | Sponsor us |
---|---|---|
Feature-engine | Multiple transformers for missind data imputation, categorical encoding, variable transformation and discretization, feature creation and more. | Sponsor us |
Get to know who's behind Feature-engine scene.
Instructor | Role |
---|---|
Soledad Galli | Maintainer |
Follow us on social media or through our website to be up to date with our latest news.
Media | Summary |
---|---|
Train in Data | Enroll in our courses and books |
I talk about data science, machine learning and how to become a data scientist. | |
I tweet about data science, machine learning and how to become a data scientist. | |
I talk about data science, machine learning and how to become a data scientist. | |
I post about data science, machine learning and how to become a data scientist. | |
Newsletter | I talk about data science, machine learning and how to become a data scientist. |
Blog | I write about data science, machine learning, feature engineering and selection and more. |
Help me gather a team of regular software developers and data scientists to fast track the development of Feature-engine's functionality and documentation. Joing our sponsors through Github sponsors or Buy me a coffee and help us democratize data science and machine learning tools and knowledge!
We hope to see you around.