Data Engineering
- https://www.startdataengineering.com/
- https://medium.com/expedia-group-tech/software-architectural-patterns-in-data-engineering-5d3bf22106a0
- https://medium.com/@shailav.shrestha/reference-types-of-sql-joins-4511cc802f02
- https://medium.com/@patrickelhageuniv/what-are-abstract-data-flows-and-why-should-you-use-them-long-form-381973a3c724
- https://towardsdatascience.com/data-mesh-topologies-and-domain-granularity-65290a4ebb90
- https://medium.com/@chitrarth236/window-functions-in-sql-fundamentals-6e861b486e8f
- https://www.analyticsvidhya.com/blog/2022/06/hive-advance-performance-tuning-techniques/
- https://medium.com/@alexander.marquardt/data-integration-guide-techniques-technologies-and-tools-airbyte-62041ec2adb6
- https://medium.com/whispering-data/the-state-of-data-engineering-2022-d6ef0f7cf607
- https://medium.com/kyligence/7-must-know-data-buzzwords-in-2022-9d3d977a43f4
- https://dataproducts.substack.com/p/an-engineers-guide-to-data-contracts
- https://medium.com/@masterkeshav/decentralized-domain-data-engineering-part-i-c1ada63c2023
- https://medium.com/@masterkeshav/decentralized-domain-data-engineering-part-ii-842a8589250c
Machine Learning
- https://stumpfkevin.medium.com/what-is-operational-machine-learning-35dd735a1e44
- https://towardsdatascience.com/feature-engineering-for-machine-learning-3a5e293a5114
- https://www.kaggle.com/code/prashant111/a-reference-guide-to-feature-engineering-methods/notebook
- https://neptune.ai/blog/the-ultimate-guide-to-evaluation-and-selection-of-models-in-machine-learning
- https://towardsdatascience.com/the-ultimate-guide-to-adaboost-random-forests-and-xgboost-7f9327061c4f
- https://towardsdatascience.com/which-machine-learning-model-to-use-db5fdf37f3dd
- https://habr.com/ru/company/selectel/blog/702416/
ML Ops
Architecture
Patterns
- https://rspacesamuel.medium.com/design-patterns-every-data-engineer-should-know-f6c48cd73592
- https://www.eckerson.com/articles/data-pipeline-design-patterns
- https://www.dataplatformschool.com/blog/data-engineering-patterns/
Data Science
- https://towardsdatascience.com/6-dimensionality-reduction-techniques-how-and-when-to-use-them-e4891c10b5db
- https://habr.com/ru/company/timeweb/blog/666024/ (DS + Sport)
- https://towardsdatascience.com/approaches-for-addressing-unfairness-in-machine-learning-a31f9807cf31
Data Architecture
- https://towardsdatascience.com/data-mesh-topologies-and-domain-granularity-65290a4ebb90
- https://medium.com/quantumblack/lakes-warehouses-lakehouses-a-short-history-of-data-architecture-bc942b0ed463
- https://blog.devgenius.io/what-is-a-data-mesh-and-what-is-it-used-for-a59f5c8f1fa2 (4 principles of a Data Mesh)
- https://medium.com/@david.c.dupuis/data-mesh-explained-a95b6ae50878
- https://kenzanmedia.medium.com/driving-growth-with-data-mesh-architectures-e10a009dc7e
Misc
- https://medium.com/@matteopelati/are-rust-c-and-wasm-the-new-tools-for-data-engineering-502f007af1d (Rust for data)
- https://getdozer.io/
- https://towardsdatascience.com/the-skeleton-of-a-data-science-project-1559138480d0
- https://blog.devgenius.io/data-engineer-learning-path-6286e537a9c2
- https://teepika-r-m.medium.com/materialized-views-in-hive-6ea621944446
- https://blog.devgenius.io/globally-optimized-data-pipelines-on-the-cloud-airflow-spark-d894bf06d9d1
- https://dbdiagram.io/
- http://agiledata.org/