Education & Research
Full-time
Data Engineer
Elicit
Location
Remote, USA
Type
Full-time
Posted
Jan 11, 2022
Mission
What you will drive
- Build and optimize an academic research paper pipeline
- Architect and implement robust, scalable solutions to handle growing data needs while maintaining high performance and data quality
- Work on efficiently processing, deduplicating, and indexing hundreds of millions of research papers
- Optimize Spark jobs and data pipelines to handle large amounts of data efficiently
- Develop processes to ensure regular updates from multiple academic paper sources are handled with efficient deduplication
Impact
The difference you'll make
This role helps organize and make accessible hundreds of millions of academic research papers, potentially accelerating scientific discovery and knowledge dissemination.
Profile
What makes you a great fit
- Experience building and optimizing data pipelines
- Proficiency with Spark and handling large-scale data processing
- Ability to architect scalable solutions for growing data needs
- Skills in data deduplication and indexing
Benefits
What's in it for you
No benefits information provided in the job description.
About
Inside Elicit
Elicit appears to be an organization focused on organizing and making academic research papers more accessible through data engineering and technology solutions.