Data Scientist and Application Developer
Job Category:
Fulltime Regular
Exempt Overtime Eligible: Exempt
Benefits Eligible: Benefit Based
Caltech is a world-renowned science and engineering institute that marshals some of the world's brightest minds and most innovative tools to address fundamental scientific questions. We thrive on finding and cultivating talented people who are passionate about what they do. Join us and be a part of the diverse Caltech community.
Job Summary
The Einstein Papers Project at Caltech has an opening for a Data Scientist and Application Developer. Come be a part of the team that is researching and publishing Albert Einstein's published and unpublished writings, his letters and those of his colleagues, family, and friends, his calculation drafts, notebooks, and interviews.
This role will work with a vibrant team of scientists and historians and philosophers of science to carry out text and image processing, extracting relevant information from printed sources, photographs, and handwritten manuscripts. The major source of data consists of some 100,000 archival records and tens of thousands of printed sources. We are interested in archive services and analysis tools related to the discovery of specific search objects (text, equations, concepts) and their characterization.
Essential Job Duties
- As an Einstein Papers analyst and developer, you will work with the team members to develop systems to ingest, process and deliver content and tools to users, using modern methodologies and AI software, particularly those tools developed to support digital humanities research, methods and platforms.
- Developing and improving infrastructure to ingest and organize database records and their associated files (jpg, tiff, pdf)
- Automating and improving the management of our repositories (move data to GitHub or other cloud-based repository)
- Streamline deployment of software into test and production environments, through continuous integration, and troubleshoot runtime issues.
Basic Qualifications
- Bachelor's degree in Computer Science/Engineering, Information Sciences, or related field.
- Three or more years of relevant experience developing code using python or similar programming languages
- System administration of local networks
- Web server and website administration and operation
- Dynamic database design
- Good familiarity with the TeX-typsetting system
- Strong communication and interpersonal skills.
Preferred Qualifications
- Knowledge of Adobe products (Photoshop, InDesign) and Adobe FrameMaker (or willingness to learn).
- Production experience with AWS.
- Experience with configuration management, Git/GitHub.
- Strong Linux/UNIX skills.
- Experience with SQL.
- Ability to read German is a great plus.
Required Documents
- Resume.
To be considered for this position please visit our web site and apply on line at the following link: https://apptrkr.com/6875918
We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.
Unlock this job opportunity
View more options below
View full job details
See the complete job description, requirements, and application process
Stay on their radar
Join the talent pool for CalTech - California Institute of Technology
Join Talent PoolExpress interest in this position
Let CalTech - California Institute of Technology know you're interested in Data Scientist and Application Developer
Get similar job alerts
Receive notifications when similar positions become available




%20Jobs.jpg&w=128&q=75)
.jpg&w=128&q=75)


