
Internship data science "Generative AI"
- Ridderkerk, Zuid-Holland
- Training
- Voltijds
- Identify data sources, create a data model, and a data flow together with the end-users
- Develop a minimum viable conversational web interface to enable end-users
- Fine-tune the LLM model (via in-context learning, prompt engineering, partial retraining)
- Validate the results in practice via quantitative and qualitative methods
- Data science: this will be the student, and the Alstom data scientist. This team will be responsible for the execution of the data analysis and the development of the intelligence
- Fleet support center officer: this team will be responsible for the validation of the model by the integration of the intelligence in the daily operations
- Engineering: this team will be responsible for the theoretical validation of the data model
- Fleet / PI manager: this person is the business owner and the receiver of the final product. This person will indicate requirements of the final solution
- Proficiency in Python.
- Understanding of statistical methods and hypothesis testing using libraries like SciPy and Statsmodels.
- Familiarity with machine learning algorithms and libraries such as Scikit-learn and TensorFlow.
- Familiarity with language processing and semantic search.
- Experience with data cleaning, preprocessing, and transformation using Pandas and NumPy.
- Ability to create meaningful visualizations using Matplotlib, Seaborn, and Plotly.
- Knowledge of Git for collaborative work.
- Ability to clearly present findings and insights using Jupyter Notebooks and other relevant tools.