What is Kedro?

Kedro is an open-source Python framework for creating reproducible, maintainable and modular data science code. It borrows concepts from software engineering and applies them to machine-learning code; applied concepts include modularity, separation of concerns and versioning. Kedro is hosted by the LF AI & Data Foundation.

For the source code, take a look at the Kedro repository on Github.

Learn how to use Kedro

In the next few chapters, you will learn how to install Kedro and set up your own production-ready data pipelines.

Once you are set up, we suggest working through our examples, including:

We also recommend the frequently asked questions and the API reference documentation for additional information.

Assumptions

We have designed the documentation and the spaceflights tutorial for anyone new to Kedro. The more knowledge of Python you have, the easier you will find the learning curve.

Note

There are a number of excellent online resources for learning Python, but you should choose those that reference Python 3, as Kedro is built for Python 3.6+. There are many curated lists of online resources, such as the official Python programming language website and this list of free programming books and tutorials.