Data Engineer, CGIR
Job Description Summary
The Center for Guaranteed Income Research (CGIR) is an unconditional cash-transfer research center headquartered at the School of Social Policy & Practice at the University of Pennsylvania. CGIR conducts applied cash-transfer studies and pilot designs in concert with community-based organizations and government stakeholders to add to the empirical scholarship on cash, economic mobility, and poverty. CGIR is currently conducting the world's largest multi-site evaluation of nearly 40 guaranteed income pilots across the nation as part of The American Guaranteed Income Studies (AmGIS). Using a common research design and a core survey across its pilot sites, the aims of AmGIS are to (1) facilitate comparisons across pilot sites, and (2) create a singular master dataset pooling data from more than 19,000 study participants across treatment and control groups. In addition, CGIR is augmenting its original survey data with administrative data from across its diverse pilot sites.
The Data Engineer will work closely with CGIR's Biostatistician, Center Leadership, and Research Scientists to design and manage the data infrastructure for the AmGIS sites described above. This portfolio of work includes: (1) the organization and structuring of raw data; (2) the oversight of data quality assurance, cleaning, and control activities; (3) the collation of individual site-specific datasets into a singular, master dataset; (4) the integration of original, survey data with administrative data; and (5) the security, integrity, and storage of data sets, codebooks, coding, and other associated documentation. In sum, the Data Engineer will be responsible for all tasks related to data acquisition; data transformation and cleaning; data integration; and data storage and security. In addition, the Data Engineer will also supervise student research assistants.
A bachelor's degree and 3 -5 years' experience is required. Preferably in computer engineering, information systems or a related field is required, and an advanced graduate degree is preferred. A minimum of 1-2 years of professional experience in data engineering, cloud platforms or database is mandatory with demonstrated expertise in coding and programming in both SQL and Python, data warehousing, machine learning, cloud computing, data integration tools, and data security. Additional requisite skills include strong critical thinking, interpersonal, communication, problem-solving skills, and experience supervising. This is a hybrid position with a requirement of 2-3 days per week in-person on campus. Contingent upon grant funding.
Job Description
Job Responsibilities:
Data Infrastructure and Management:
- Develop robust data cleaning and validation pipelines with comprehensive error checking
- Create and maintain metadata standards and documentation systems for research datasets
- Collaborate with IT team on cloud infrastructure migration and data security protocols
- Design and implement systematic approaches to data quality assurance and control
- Build processes for integrating multi-site datasets into master research databases
- Establish version control and reproducible workflows for all data processing activities
Advanced Analytics and Machine Learning
- Apply machine learning and advanced statistical techniques to research questions
- Develop predictive models and analytical frameworks for longitudinal data
- Create data visualizations
- Conduct complex data linkage and matching between survey and administrative datasets
- Support multiple research studies with timely, high-quality analytical deliverables
Research Collaboration, Supervision, and Communication:
- Collaborate with center leadership and research scientists on study design and methodology
- Translate complex technical concepts for diverse stakeholder audiences
- Contribute to grant proposals, white papers, and peer-reviewed publications
- Present findings to academic, policy, and media audiences
- Participate in research team meetings and project planning activities
- Supervise student research assistants as needed
Project management:
- Develop and manage timelines for data cleaning
Other duties as assigned
- Must support SP2 Graduation and Commencement activities
Qualifications
Bachelor of Science, Bachelors of Science, and 3 to 5 years of experience or equivalent combination of education and experience is required. Preferably in computer engineering, information systems or a related field is required, and an advanced graduate degree is preferred. A minimum of 1-2 years of professional experience in data engineering, cloud platforms or database is mandatory with demonstrated expertise in coding and programming in both SQL and Python, data warehousing, machine learning, cloud computing, data integration tools, and data security. Additional requisite skills include strong critical thinking, interpersonal, communication, problem-solving skills, and experience supervising. This is a hybrid position with a requirement of 2-3 days per week in-person on campus. Contingent upon grant funding.
Job Location - City, State
Philadelphia, Pennsylvania
Department / School
School of Social Policy and Practice
Pay Range
$70,500.00 - $95,000.00 Annual Rate
Whoops! This job is not yet sponsored…
Or, view more options below
View full job details
See the complete job description, requirements, and application process
Express interest in this position
Let University of Pennsylvania know you're interested in Data Engineer, CGIR
Get similar job alerts
Receive notifications when similar positions become available