Internship spring 2022 – building a computer lab

Job description

In recent years, we have developed the ability to use computers to process large amounts of data. The ecosystem evolved over a wide range of tools and libraries and the creation of the field of computer science. Connecting all these components into a coherent and secure platform is a daunting task. Newcomers, as well as more experienced users, benefit from platforms that offer a premium developer experience.

Data Labs provides developers with a comprehensive software suite that helps them explore, visualize, process and expose data. Using their favorite languages ​​like Python, JavaScript or SQL, they build pipelines to collect and store data, build visualization dashboards and implement machine learning models.

As part of your internship, you will assemble several open source technologies to provide data scientists with a modern environment that fits their needs. Data scientists expect a user-friendly web interface to provide their favorite development editors, the ability to use their favorite libraries without restrictions in an isolated and self-contained environment, scaling resources according to their requirements, and the ability to push their code into production.

The Datalab platform relies on the flexible Kubernetes backend combined with document storage compatible with all S3 standard interfaces. On-demand containers should be provisioned and cover a large panel of databases (Elasticsearch, MongoDB, PostgreSQL…), environments (TensorFlow, VSCode, Ju surfaces, RStudio…), and complementary tools such as secret management with Vaultautomated provisioning with Argo CD, OpenID Connect authentication with Keycloakworkflow scheduling, API publishing, …

During this internship, you will become familiar with Kubernetes and CNCF ecosystem, gain a deep understanding of the roles and responsibilities expected of data scientists and become comfortable serving their needs. You will be part of an agile team led by a Data Science expert.

In addition, at the end of the internship, you will receive a certification from a Cloud providerand one Databricks certification.

Company presentation

Adaltas is a consulting agency led by a team of open source experts with a focus on data management. We deploy and operate the storage and computing infrastructure in collaboration with our customers.

Collaborating with Cloudera and Databricks, we are also open source contributors. We invite you to browse our website and our many technical publications to learn more about the company.

Responsibility

  • Understand and address the need for data science
  • learn the different moving parts in a Datalab
  • Deploy Datalab in a Kubernetes cluster
  • Implement machine learning workflows

Expected Qualifications

  • School of engineering, graduation internship
  • Analytical and structured
  • Autonomous and curious
  • You are an open-minded person who enjoys sharing, communicating and learning from others
  • Good knowledge of Python, Spark and Linux systems

You will be responsible for understanding the architecture and integrating it with an existing infrastructure. You will work with InfraOps and data scientists. We are looking for a person who will develop competence in the following tools and solutions:

Any additional experience is valuable.

More information

  • Place: Boulogne Billancourt, France
  • Language: French or English
  • Start: February 2022
  • Duration: 6 months
  • Remote work: possibility to work 2 days a week remotely

Available hardware

A laptop with the following features:

  • 32 GB of RAM
  • 1TB SSD
  • 8c/16h CPU

A cluster consisting of:

  • 3x 28c/56t Intel Xeon Scalable Gold 6132
  • 3x 192TB RAM DDR4 ECC 2666MHz
  • 3x 14 SSD 480GB SATA Intel S4500 6Gbps

A Kubernetes cluster.

Replacement

  • Salary €1200/month
  • Restaurant tickets
  • Transport card
  • Participation in an international conference

Previously included the conferences we attended CubeCon organized by the CNCF Foundation Open Source Summit from the Linux Foundation and Fosdem.

For any request for further information and to submit your application, please contact David Worms:

#Internship #spring #building #computer #lab

Source link

Leave a Reply