From Aviation Hardware to Data Science: Why I’m Trading Excel for R and SQL
Summary
The author transitioned from a Reclamation Engineer at an aviation plant, Krasny Oktyabr, to pursuing a Master's in Data Science at CUNY SPS. While managing approximately 1,000 failure reports annually, the author developed manual Excel systems to track part failures and improve efficiency in a predominantly manual data collection culture. This experience highlighted the critical need for automated reporting, instant visualization, and data integrity, which are core functionalities of tools like R, Python, and SQL. The author's background in hands-on quality control, directly inspecting and verifying physical parts, provides a unique perspective on data quality, emphasizing that data labels represent tangible components and real-world issues, aligning with Data-Centric AI principles.
Key takeaway
For Data Scientists or Engineers transitioning into data roles, your direct experience with physical processes or "the metal" is a significant asset. This background enables you to intuitively grasp data quality issues and their real-world implications, a perspective crucial for building robust, effective AI models. Embrace this unique understanding to drive better data-centric solutions.
Key insights
Hands-on experience with physical systems provides a critical advantage in understanding data quality for Data Science.
Principles
- Data quality limits model effectiveness.
- Automate reporting for efficiency and integrity.
In practice
- Transition from manual Excel to R, Python, SQL.
- Apply shop floor discipline to data engineering.
Topics
- Data Science Career Path
- Data Quality
- Data-Centric AI
- Manufacturing Analytics
- Data Management Challenges
Best for: Data Scientist, AI Student, General Interest
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Data Science on Medium.