1 of 5 free roles viewed today. Upgrade to premium for unlimited from only $19.99 with a 2-day free trial.

Lead Data Engineer_San Francisco, CA_Only on W2_No C2C/1099

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Lead Data Engineer in San Francisco, CA, with a contract length of "unknown" and a pay rate of "unknown." Key skills include Python, Spark, Java, and experience with Databricks. Requires 8-10 years in data processing and ML model deployment.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date discovered

April 3, 2025

🕒 - Project duration

Unknown

🏝️ - Location type

On-site

📄 - Contract type

1099 Contractor

🔒 - Security clearance

Unknown

📍 - Location detailed

San Francisco, CA

🧠 - Skills detailed

#NoSQL #"ETL (Extract #Transform #Load)" #Redis #Transformers #Data Lake #Big Data #Data Processing #Delta Lake #Spark (Apache Spark) #Databricks #Keras #TensorFlow #Batch #Data Accuracy #Libraries #Python #Data Warehouse #PyTorch #Databases #Leadership #Deep Learning #Java #Data Pipeline #Spring Boot #SQL (Structured Query Language) #Data Quality #ML (Machine Learning) #Requirements Gathering #Jupyter #Kafka (Apache Kafka) #Data Engineering

Role description

You've reached your limit of 5 free role views today.
Upgrade to premium for unlimited access - from only $19.99.

Responsibilities

Development Tasks:

• Collect metrics based on user interactions.

• Visualize data for business teams.

• Develop and redesign data pipelines using Kafka streams.

• Implement solutions using Spring Boot Java and Databricks Spark streaming.

Leadership Duties

• Lead the measurement processes from requirements gathering to production delivery.

• Collaborate with other team leads, business partners, and product managers.

• Balance between hands-on engineering (50%) and team leadership (50%).

Collaboration Structure

• Onsite: Lead role (this resource)

• Nearshore: Senior developer.

• Offshore: Data engineer role.

Lead Data Engineer - Job Description

Required Skills & Experience:

• Hands-on code mindset with deep understanding in technologies / skillset and an ability to understand larger picture.

• Sound knowledge to understand Architectural Patterns, best practices and Non-Functional Requirements

• Overall, 8-10 years of experience in heavy volume data processing, data platform, data lake, big data, data warehouse, or equivalent.

• 5+ years of experience with strong proficiency in Python and Spark (must-have).

• 3+ years of hands-on experience in ETL workflows using Spark and Python.

• 4+ years of experience with large-scale data loads, feature extraction, and data processing pipelines in different modes – near real time, batch, realtime.

• Solid understanding of data quality, data accuracy concepts and practices.

• 3+ years of solid experience in building and deploying ML models in a production setup. Ability to quickly adapt and take care of data preprocessing, feature engineering, model engineering as needed.

• Preferred: Experience working with Python deep learning libraries like any or more than one of these - PyTorch, Tensorflow, Keras or equivalent.

• Preferred: Prior experience working with LLMs, transformers. Must be able to work through all phases of the model development as needed.

• Experience integrating with various data stores, including:

• SQL/NoSQL databases

• In-memory stores like Redis

• Data lakes (e.g., Delta Lake)

• Experience with Kafka streams, producers & consumers.

• Required: Experience with Databricks or similar data lake / data platform.

• Required: Java and Spring Boot experience with respect to data processing - near real time, batch based.

• Familiarity with notebook-based environments such as Jupyter Notebook.

• Adaptability: Must be open to learning new technologies and approaches.

• Initiative: Ability to take ownership of tasks, learn independently, and innovate.

• With technology landscape changing rapidly, ability and willingness to learn new technologies as needed and produce results on job.

Preferred Skills:

• Ability to pivot from conventional approaches and develop creative solutions.