Engineering Practices for Data Scientists
A handbook of everything data scientists need to know about engineering.
What's in the Book
Data scientists are entering the development world because machine learning is becoming a core part of many products. It’s better to be prepared for the slog of building and maintaining software.
This eBook will help you pick up engineering best practices with simple tips.
Git
We give an overview of what Git is, explain the terminology and basic commands, and provide rules of thumb to make work with Git smooth and painless.
Docker
The fundamental idea is to package an application and its dependencies into a single reusable artifact, which can be instantiated reliably in different environments.
Command line
The command line generally follows the philosophy of "Make each program do one thing well". The fundamental premise is that you can do complex things by combining these simple programs.