Python Data Engineer
Python Data Engineer
Hybrid - NYC or Princeton NJ
Long Term
Required Skills:
• Minimum of 5 years’ experience in designing and building ETL workflows using Python and PySpark.
• Advanced knowledge of SQL and database design principles, with the ability to handle large-scale data volumes.
• Experience in data warehousing methodologies, dimensional modeling, and ETL best practices.
• Hands-on experience with big data technologies such as Hadoop, Hive, or similar.
• Proficiency with workflow orchestration tools like Airflow or Tidal.
• Ability to gather and interpret business requirements for technical solutions.
• Degree in Computer Science, Engineering, or a related field.
• Experience managing data systems with volumes exceeding 20TB.
• Familiarity with optimizing distributed databases, including partitioning and sharding strategies.
• Knowledge of reporting tools such as Tableau, Qlik Sense, or similar.
• Proven track record of working in agile environments with data discovery and self-service initiatives.