Please note that Coinbase no longer supports this browser. We recommend upgrading to the latest Google Chrome or Firefox.

Senior Software Engineer - Data Engineering

San Francisco, CA

Back To All Jobs

Senior Software Engineer - Data Engineering
San Francisco, CA

Our vision is to bring more innovation, efficiency, and equality of opportunity to the world by creating an open financial system. Our first step on that journey is making digital currency accessible and approachable for everyone.

Two principles guide our efforts. First, be the most trusted company in our domain. Second, create user-focused products that are easier and more delightful to use than any alternative. Those principles guide every decision across the company from design through engineering, from operations through security. One key ingredient for making informed decisions is reliable and timely access to data and that’s where you come in.

You will get the chance to build our next generation of data and machine learning pipelines and scoring systems from the ground-up. Our data pipeline moves several terabytes of data from our production database (Mongo) to analytical database (Redshift). We use machine learning to detect a variety of bad-actors on our platform including payment-fraudsters, risky users from a compliance perspective, users providing fake IDs, etc.


  • ETL pipeline: Maintain and build our next generation Extract Transform Load (ETL) pipeline. Your specific challenge would be to build this for both scale (handle 10x data) and speed (ensure 1 minute or less of lag time)
  • ML pipeline: Redesign our Machine Learning (ML) pipeline using Apache Spark
  • Deep Learning pipeline: Build a deep learning pipeline for image classification tasks like detecting fake and photoshopped IDs
  • ML scoring: Build a micro-service that allows us to get a user’s risk score in 100 msec or less


  • Exhibit our core cultural values: add positive energy, communicate clearly, be curious, and be a builder
  • Experience building at least one big-data pipeline in production
  • Deep knowledge of at least one of the following big-data databases e.g., Spark, Hadoop, Hbase, Cassandra, DynamoDB
  • Experience building micro-services

Preferred (not required):

  • Computer Science or related engineering degree
  • Experience with Machine Learning a plus, but not required

What to send

  • A resume and a link to your GitHub or blog post showcasing something awesome you've built


We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.


Apply For This Job
* = required field