Skip to content
@trainindata

Train In Data

Advanced content creation for data science and machine learning

Welcome to Train in Data

We are a group of passionate data scientists and software developers with the mission to make intermediate and advanced topics on machine learning, data science and AI software engineering accessible to the wider data science community.

We create intermediate and advanced online courses on machine learning, data science and AI software development. We also maintain an open-source library for feature engineering: Feature-engine.

In addition, we talk, blog and participate in podcasts about machine learning, software development and open-source.

Online Courses

Check out the courses that we teach.

Courses What you will learn
Feature engineering for machine learning Learn to create new features, impute missing data, encode categorical variables, transform and discretize features and much more.
Feature selection for machine learning Learn to select features using wrapper, filter, embedded and hybrid methods, and build simpler and reliable models.
Hyperparameter optimization for machine learning Learn about grid and random search, Bayesian Optimization, Multi-fidelity models, Optuna, Hyperopt, Scikit-Optimize and more.
Machine learning with imbalanced data Learn about under- and over-sampling, ensemble and cost-sensitive methods and improve the performance of models trained on imbalanced data.
Feature engineering for time series forecasting Learn to create lag and window features, impute data in time series, encode categorical variabes and much more, specifically for forecasting.
Forecasting with Machine Learning Learn to perform time series forecasting with machine learning models like linear regression, random forests and xgboost.
Machine Learning Interpretability Learn to interpret the predictions of your white box and black box machine learning models.

Books

Find out more about machine learning through our books, and have the code at your fingertips.

Books Summary
Python feature engineering Cookbook, third edition Over 70 Python recipes to implement feature engineering in tabular, transactional, time series and text data.
Feature selection in machine learning with Python Over 20 methods to select the most predictive features and build simpler, faster, and more reliable machine learning models.

Open-source

The open-source libraries I contribute to.

Library About Sponsor us
Feature-engine Multiple transformers for missind data imputation, categorical encoding, variable transformation and discretization, feature creation and more. Sponsor us

Our instructors

Get to know the creators and instructors of our courses.

Instructor Role
Soledad Galli Data scientist
Kishan Manani Data scientist
Chris Samiullah Software developer

Follow us

Follow us on social media or through our website to be up to date with our latest news.

Media Summary
Train in Data Enroll in our courses and books
Newsletter I talk about data science, machine learning and how to become a data scientist.
YouTube I post about data science, machine learning and how to become a data scientist.
LinkedIn I talk about data science, machine learning and how to become a data scientist.
Twitter I tweet about data science, machine learning and how to become a data scientist.
Facebook I talk about data science, machine learning and how to become a data scientist.
Blog I write about data science, machine learning, feature engineering and selection and more.

Profile views counter


We hope to see you around.

Pinned Loading

  1. feature-engineering-for-time-series-forecasting feature-engineering-for-time-series-forecasting Public

    Code repository for the online course "Feature Engineering for Time Series Forecasting".

    Jupyter Notebook 178 129

  2. machine-learning-interpretability machine-learning-interpretability Public

    Forked from solegalli/machine-learning-interpretability

    Jupyter Notebook 5 6

  3. forecasting-with-machine-learning forecasting-with-machine-learning Public

    Code repository for the course "Forecasting with Machine Learning Models"

    Jupyter Notebook 15 8

Repositories

Showing 10 of 10 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…