
Internship data science "pattern recognition"
- Ridderkerk, Zuid-Holland
- Training
- Voltijds
- A data set with all events measured on the train is available. This consists of details like: timestamp, subsystem affected, context information, location.
- Interventions and main train event need to be linked together, as the timestamp of intervention is per definition different than the timestamp of the event.
- After the identification of the main event, the analysis of the relationship between events and operating data can be executed. Data analysis should point out which data point, or combination of data points, are confident predictors of the failure of the train or subsystem at point P during the health degradation process.
- The outcome of step 4 can be implemented in the real-time monitor of the related customer, to validate the intelligence into practice
- Data science: this will be the student, and the Alstom data scientist. This team will be responsible for the execution of the data analysis and the development of the intelligence
- Fleet support center officer: this team will be responsible for the validation of the model by the integration of the intelligence in the daily operations
- Engineering: this team will be responsible for the theoretical validation of the data model
- Fleet / PI manager: this person is the business owner and the receiver of the final product. This person will indicate requirements of the final solution
- Proficiency in Python.
- Understanding of statistical methods and hypothesis testing using libraries like SciPy and Statsmodels.
- Familiarity with machine learning algorithms for time series data using libraries such as Scikit-learn and TensorFlow/Keras.
- Experience with data cleaning, preprocessing, and transformation using Pandas and NumPy.
- Ability to create meaningful visualizations using Matplotlib, Seaborn, and Plotly.
- Basic understanding of electrical devices and their failure modes.
- Knowledge of Git for collaborative work.
- Ability to clearly present findings and insights using Jupyter Notebooks and other relevant tools.