The vast majority of enterprise data — from financial statements to health records — are locked in unstructured file formats like PDFs and spreadsheets. Reducto is the most accurate way to parse and extract data from complex documents.
Today we power ingestion pipelines for hundreds of leading AI teams, ranging from popular startups to Fortune 10 enterprises. We’ve grown incredibly quickly (0→7 fig in ARR in 6 months), are loved by customers (>300M pages parsed), and are well funded by tier 1 investors.
As a member of our founding team you’ll work on our core API and on prem deployments. That means you’ll have a hand in everything that our customers need.
Philosophy: You are your own worst critic. You have a high bar for quality and don’t rest until the job is done right—no settling for 90%. We want someone who ships fast, with high agency, and who doesn't just voice problems but actively jumps in to fix them.
Experience: You have 2 to 5 years of experience with training, fine tuning, and evaluating ML models used in production systems
Language/Skills: You’re exceptional at Python or similar, and are well versed with both traditional computer vision and VLMs
Tools: Build your own tools as needed—like a quick Streamlit app to test hypotheses or create a dataset.
Approach: A quantitative approach to building products. Ability to debug, experiment, and iterate fast. You should be comfortable getting hands-on with the full development lifecycle, from ideation to shipping to users.
Training and deploying new state of the art models for parsing and interpreting unstructured data
Experimenting with novel techniques to improve LLM accuracy
Build data pipelines, evaluate model performance, and integrate models into the product
Working directly with the founders and customers to shape the product direction and engineering strategy
Have prior experience founding a company or building products at early stages
Are ambitious and driven, and care a lot about doing great work with great people
Keep up with the latest developments in ML/AI
This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.
Nearly 80% of enterprise data is in unstructured formats like PDFs
PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.
Traditional approaches fail at reliably extracting information in complex PDFs
OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.
Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you:
Accurately extract text and tables even with nonstandard layouts
Automatically convert graphs to tabular data and summarize images in documents
Extract important fields from complex forms with simple, natural language instructions
Build powerful retrieval pipelines using Reducto’s document metadata
Intelligently chunk information using the document’s layout data
At Reducto, we’re invested in the well-being and growth of our team. Here’s what we currently offer:
Unlimited PTO: We believe great work requires recharging.
Lunch: Receive a free lunch to eat with your teammates daily at the office
Reimbursed Transportation: Provide us with your receipts and we’ll take care of the costs
Insurance:
Health: Medical, dental, and vision.
Financial Security: Short-term and long-term disability, life insurance, and voluntary life insurance.
Extra Support: Accident, hospital, and critical illness insurance.
Health and Wellness Budget: We provide up to $150/mo reimbursement for health and wellness spending, such as gym memberships, fitness classes, or similar.
Parental Leave: Work with us to build a leave schedule that works for you and your family
Reducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to sex, race, color, age, national origin, religion, physical and mental disability, genetic information, marital status, sexual orientation, gender identity/assignment, citizenship, pregnancy or maternity, protected veteran status, or any other status prohibited by applicable national, federal, state or local law.