Summary of Position
As a critical part of the Transparency and Insights (T&I) team at CSE, this Data Scientist role is dedicated to enhancing the organization's capabilities in program analysis, evaluation, and implementation through data-driven insights. The individual will play a key role in managing, analyzing, and interpreting data. Additionally, this position is responsible for contributing to the development of novel models for various projects within the CSE Transparency & Insights Department.
Core Responsibilities
Data Analysis and Insights Generation
- Assemble, manipulate, analyze, and interpret data from diverse sources, potentially collaborating with external research partners.
- Apply statistical models and machine learning techniques to derive insights from data.
- Ensure the quality and accuracy of data analysis performed, implementing quality control (QC) measures.
- Document analytical methods and contribute to the development of standard operating procedures (SOPs).
- Assist in report automation efforts, focusing on text and figure generation for internal and external clients.
Data Visualization and Application Support
- Develop and maintain web-based data visualizations, interactive maps, and reporting tools to communicate insights effectively.
- Facilitate the extraction and reporting of data from PostgreSQL databases, flat files, and other data sources.
- Work under the mentorship of senior T&I team members to meet project objectives.
Core Knowledge and Skills
Technical Proficiency
- Experience with Generative AI APIs and proficiency in prompt engineering for various applications, including but not limited to data extraction and content generation.
- Proficient in Python for data and statistical analysis, utilizing libraries like pandas, matplotlib, and sklearn.
- Knowledgeable in object-oriented programming and ETL pipeline development.
- Skilled in geospatial analysis, including the use of libraries like Geopandas and Shapely.
- Experience with standard geospatial data formats (e.g., shapefiles), spatial databases (e.g., PostGIS) and GIS software (e.g., QGIS, ArcGIS).
- Experienced with software version control using Git.
- Strong skills in Microsoft Excel, Word, and PowerPoint.
- Experience with data visualization tools (e.g., Tableau, Plotly, Quicksight) and survey platforms (e.g., Alchemer).
Cloud Computing
- Basic knowledge of AWS services such as IAM, EC2 and RDS.
Soft Skills
- Exceptional attention to detail, along with strong organizational and problem-solving abilities.
- Excellent communication skills, both oral and written.
- Self-motivated with the ability to manage deadlines for multiple concurrent projects, using project management tools effectively.
- Proactive in communicating with team members in a remote environment and open to seeking and receiving feedback.
Special Interests
- A keen interest in data science applications within research and sustainable energy sectors.
Preferred Skills
A foundational understanding of the energy sector, including electric vehicles, energy efficiency, distributed generation, or energy storage.
Required Education and Experience
- 1-3 years of experience in data analysis or a related field.
- Bachelor’s degree in a field with substantial analytical content, such as statistics, economics, computer science, social science, environmental science, or engineering.
Additional Information
Occasional evening and weekend work may be required to meet project deadlines.