AI lipsync tool for video content creators
At sync., we're not just building video generation and manipulation models - we're changing how the world creates and interacts with video. Backed by Google, we’re on a mission to scale up our models and have them impact every human on the planet – from recording any tik tok in a single take, to watching Iron Man speak flawless Hindi on Netflix.
We’re looking for a 10x hacker who loves to engineer data.
Data is the lifeblood of bleeding edge models, and we’re looking to scale up collection, processing, and training one-hundred-fold over the next 12 months.
You’ll have the autonomy to build and scale our systems at breakneck speed, working directly w/ our founders and world class research team to maximally leverage data + compute to unlock new capabilities.
This isn't just a normal role – it's a position where you'll shape the future of a high-impact industry-defining company, reporting directly to our CTO.
- Architect, build, and scale our entire data and training infrastructure to train models that change how we live, work, and play forever.
- Design and implement robust systems for acquiring, processing, and storing massive volumes of video, audio, and text.
- Create scalable ETL pipelines that can handle petabytes of data with high efficiency and low latency
- Build systems to monitor and maintain data integrity, diversity, and quality.
- Collaborate with research and ML engineers to optimize data pipelines for model training and inference
- Make rapid, high-impact technical decisions that balance immediate needs with long-term scalability
- Drive innovation in our core AI/ML models through advanced data engineering techniques
- Founded or was an early data engineering lead at a successful AI/ML startup that scaled rapidly
- Track record of successful open-source contributions or thought leadership in the data engineering community
- Broad and deep technical knowledge spanning data engineering, ML ops, DevOps, and software engineering
- Experience working with video data and large-scale content management systems
- Significant equity stake and competitive salary
- Full autonomy over data infrastructure decisions
- Opportunity to define the future of AI-driven video creation and manipulation
- Work directly with the CTO and founders to scale the company
- Access to cutting-edge AI/ML resources and massive datasets
- Chance to work with and learn from world-class researchers and engineers
Our goal is to keep the team lean, hungry, and shipping fast.
These are the qualities we embody and look for:
We're a small team who works hard to create outsized impact.
Our interview process is grounded in reality:
You'll receive a decision within 24 hours of the onsite.
We want to set expectations – we work hard, work fast, and do a lot with very little. You'd be joining an outlier team at the ground floor, and a culture of obsession with engineering excellence is what we care about most.
We're moving fast - if you're excited about the challenge of engineering data, shaping the future of video technology, and having massive technical autonomy, we want to hear from you today.
at sync. we're making video as fluid and editable as a word document.
how much time would you save if you could record every video in a single take?
no more re-recording yourself because you didn't like what you said, or how you said it.
just shoot once, revise yourself to do exactly what you want, and post. that's all.
this is the future of video: AI modified >> AI generated
we're playing at the edge of science + fiction.
our team is young, hungry, uniquely experienced, and advised by some of the greatest research minds + startup operators in the world. we're driven to solve impossible problems, impossibly fast.
our founders are the original team behind the open sourced wav2lip — the most prolific lip-sync model to date w/ over 9k+ GitHub stars.
we’re at a stage today in computer vision where we were w/ NLP two years ago — have a bunch of disparate, specialized models (eg. Sentiment classification, translation, summarization, etc), but LLMs (a generalized large language model) displaced them.
we’re taking the same approach – curating high quality datasets + training a series of specialized models to accomplish specific tasks, while building up to towards a more generalized approach for one model to rule them all.
post batch our growth is e^x – we need help asap to scale up our infra, training, and product velocity.
we look for the following: [1] raw intelligence [2] boundless curiosity [3] exceptional resolve [4] high agency [5] outlier hustle