Education & Research Full-time

Data Engineer

Elicit

Location

Remote, USA

Type

Full-time

Posted

Jan 11, 2022

Mission

What you will drive

  • Build and optimize an academic research paper pipeline
  • Architect and implement robust, scalable solutions to handle growing data needs while maintaining high performance and data quality
  • Work on efficiently processing, deduplicating, and indexing hundreds of millions of research papers
  • Optimize Spark jobs and data pipelines to handle large amounts of data efficiently
  • Develop processes to ensure regular updates from multiple academic paper sources are handled with efficient deduplication

Impact

The difference you'll make

This role helps organize and make accessible hundreds of millions of academic research papers, potentially accelerating scientific discovery and knowledge dissemination.

Profile

What makes you a great fit

  • Experience building and optimizing data pipelines
  • Proficiency with Spark and handling large-scale data processing
  • Ability to architect scalable solutions for growing data needs
  • Skills in data deduplication and indexing

Benefits

What's in it for you

No benefits information provided in the job description.

About

Inside Elicit

Visit site →

Elicit appears to be an organization focused on organizing and making academic research papers more accessible through data engineering and technology solutions.