DESCRIPTION :
Entalpic is committed to equal opportunity employment and a diverse, inclusive workplace. We encourage applications from all backgrounds-even if you don't meet every requirement. If you're passionate about our mission and think you can contribute, we want to hear from you.
Reporting & Job Location
You will report to the CTO of Entalpic and be based in our Paris office.
Mission Highlights
Data Infrastructure Development
Design, build, and maintain scalable data infrastructure to integrate diverse data sources (text, simulations, experiments) in support of ML and LLM applications.
Data Platform Enhancement
Lead the development of internal tools to enable efficient, AI-enhanced access to data and promote a data-centric culture across the organization.
Role & Responsibilities
Data Engineering: Build and optimize scalable data pipelines for simulation (e.g. DFT), textual (e.g. patents, papers), and experimental data (e.g. time series, imagery).
Data Storage Solutions: Implement and manage secure, scalable data storage systems supporting analytics and ML workflows.
Automation and Scripting: Create tools and scripts to automate data ingestion, transformation, and processing.
Data Governance and Lineage: Establish policies for data quality, lineage tracking, and regulatory compliance.
Infrastructure Support: Work closely with DevOps to integrate solutions with system architecture (AWS/GCP).
Collaboration and Support: Partner with scientists and experts to meet data needs and enable data-driven decisions.
Open Source Engagement: Contribute tools and learnings to open-source projects to support the broader community.
Code d'emploi : Architecte de Données (h/f)
Domaine professionnel actuel : Spécialistes Bases de Données
Niveau de formation : Bac+5
Temps partiel / Temps plein : Plein temps
Type de contrat : Contrat à durée indéterminée (CDI)
Compétences : Intelligence Artificielle, Amazon Web Services, Analyse des Données, Cloud Computing, Programmation Informatique, Intégration Continue, Ingénierie de l'Information, Gouvernance des Données, Infrastructure de Données, ETL, Entreposage de Données, DevOps, Données Expérimentales, Python (Langage de Programmation), PostgreSQL, MongoDB, MySQL, NoSQL, Technologie Open Source, Structured Query Language (SQL), Architecture des Systèmes, Traitement des Données, Scripting, Technologies de Stockage de Données, Ingestion de Données, Large Language Models, Git, Conteneurisation, Kubernetes, Technologies Informatiques, Gestion des Données, Terraform, Software Version Control, Pipeline de Données, Docker, Programming Languages, Anglais, Sens de la Communication, Esprit d'Équipe, Implication et Investissement, Systèmes Automatisés, Matières Premières, Conformité Réglementaire, Modélisation des Données, Qualité des Données, Expérimentation, Gestion des Infrastructures, Simulations, Séries Chronologiques, Workflows
Téléphone :
0475433031
Type d'annonceur : Employeur direct