Lemon.io: Our Experience Screening 100K Developers

Sep 7 8 min read

Screening software developers looks easy on paper. Ask a few coding questions, skim through resumes, maybe check GitHub for proof. The real story? It’s messy, full of surprises, and gets exponentially harder when you scale from a hundred to a hundred thousand candidates.

That’s the journey we’ve lived at Lemon.io. Our team has processed more than 100,000 engineers in just a few years—each one aiming to join a talent pool where only 2% make the cut.

Along the way, we’ve discovered what most guides miss: screening isn’t just about technical questions or years of experience. It’s about building a process that catches what automation can’t, adapts as the talent market shifts, and still delivers consistent results for clients and candidates at scale.That’s the journey we’ve lived at Lemon.io. Our team has processed more than 100,000 engineers in just a few years—each one aiming to join a talent pool where only 2% make the cut.

We’ve learned the hard way where most companies trip up. We’ve overhauled our own approach several times, faced problems that didn’t exist at small scale, and watched as AI and automation tools hit their limits.

What follows is a look behind the curtain—what it really takes to build and run a developer screening system for 100,000+ candidates, and what we wish more companies understood about vetting top tech talent.

The journey: From First Hundred to the Hundred-Thousandth candidate

We didn’t start with a sophisticated system. The first hundred developers we screened? It was hands-on, manual, and full of trial and error. We learned on the fly. Sometimes, that meant missing great talent or letting in people who looked perfect on paper but fell apart in real projects.

The early process leaned on intuition and a few set questions. But as candidate numbers grew, weaknesses became obvious. Patterns emerged in who failed, why projects derailed, and what clients wished we’d caught sooner. Mistakes—plenty of them—forced us to step back and rethink the basics.

Scaling up changed everything. Interviewing hundreds is one thing; assessing tens of thousands surfaces new kinds of problems. You can’t rely on gut feel, or even on static scorecards, when applicant volume doubles each year. Automation became essential, but too much automation was a risk: it started to miss the messy, human signals that matter most. Every time the process got “smarter,” scammers and pattern-fitters got smarter too.

By the time we reached 100,000 candidates, the process had become a living product. We’d gone through major rewrites—tossing out what didn’t work, doubling down on what caught real talent. We integrated tools, built internal CRMs, and rolled out structured scorecards for both soft and hard skills. Every change was battle-tested by mistakes: resumes that fooled us, highly skilled engineers who failed with clients, operational glitches that nearly broke our system.

The biggest lesson? Screening at scale is never “set and forget.” The more volume you handle, the more humble you get about what you don’t know—because there’s always a new surprise waiting in the next application batch.

Behind the Scenes: What Actually Makes a Great Screening System

You don’t process 100,000 engineering candidates by doing the same thing you did on day one. At each new scale milestone, our screening process broke in ways we didn’t predict. The big leaps—the “overhaul moments”—came not from best practices, but from problems you only meet when you’re deep in the work, facing the realities you can’t plan for.

Turning Growing Pains into Process

The first overhaul was about consistency. Early on, we saw that different technical interviewers had wildly different standards. A “senior” to one person was “mid-level” to another.

Some interviewers focused on deep technical knowledge, others heavy on soft skills. As our pipeline exploded, we started seeing developers who breezed through one version of our interview but crashed into a wall in client projects.

The solution? Building detailed scorecards for every technical interviewer, and regular “calibration” sessions so that a “pass” became a shared definition, not a gut call.

Outpacing scammers and adapting for scale

Next came a breakdown in our day-to-day process. In the early days, our feedback lived in Google Sheets and chat threads. When you’re reviewing dozens of candidates a week, that’s fine.

When you hit hundreds, it’s a nightmare—lost notes, accidental double entries, feedback that vanished if you clicked the wrong button. It wasn’t sustainable.

The fix: we built out our own internal CRM, “Holy Place,” loaded with automations, Slack integrations, reminders, and a duplicate profile detector. Suddenly, feedback stuck, and scaling felt human again.

The hard limits of automation

The third crisis was trust. As Lemon.io branched into new regions and talent pools, scammers slipped in—people using fake identities, gaming resume checks, or prepping for interviews with AI-generated answers.

The old process was too slow to react. We had to introduce new steps, like short English checks before intro calls, video introductions, and automated red-flag detection for documentation. The moment we tightened the loop, scams dropped dramatically, and the baseline quality went up.

Every time we’ve tried to automate too much, we’ve found the edge where only humans matter: the subtle warning signs, the nuances in communication, the real learning potential. That’s why even today, parsing tools and bots support our work, but never replace the human eye on a resume or a live interview.

Our internal CRM—Holy Place—became a critical tool as we scaled. Holy Place brings order to what used to be scattered feedback, disconnected notes, and unpredictable follow-ups. It tracks candidate stages, integrates with Slack, sends reminders, flags duplicate profiles, and makes sure feedback doesn’t vanish. It’s the backbone that keeps the process from collapsing under its own weight. But Holy Place has a strict job description: automate the tedious, never the judgment calls.

Here’s how we draw the line:

What We Automate	What Only Humans Can Do
Reminders & status updates	Spotting subtle behavioral red flags
Duplicate profile detection	Reading intent in conversation
Tracking candidate stages	Judging real commercial experience
Feedback archiving	Sensing when a resume is “too perfect”
Slack notifications & pings	Probing for genuine problem-solving

Trying to automate human judgment didn’t just create false positives and misses—it nearly broke trust with clients and candidates. Our lesson: automate process, not people. Automate volume, not value.

Key Pitfalls Most Companies Fall Into

Every company thinks they’ve found the formula for screening technical talent—until reality proves otherwise. We’ve made most of these mistakes ourselves, and we see them repeated week after week across the industry. Here are the main traps that surface again and again when vetting developers—no matter how cutting-edge your tools, volume, or ambition.

Where most screening efforts fall short:

Pitfall	What It Looks Like in Practice	Why It Fails
Trusting resumes at face value	Taking big brands or polished GitHub as proof of skill	Credentials ≠ actual performance
Automating the work that matters	Relying on algorithms or tests to evaluate all candidates	Misses communication, intent, context
Ignoring human signals	Skipping live interviews or not probing soft skills	Misses team-fit and real-world gaps
Underestimating fraud	Accepting documents/tests at face value, not layering checks	Scammers adapt—single filters break
Treating vetting as “set and forget”	Building a system once, never revisiting how/why it works	New patterns, new risks every cycle

Believing a Resume Tells the Whole Story

It’s easy to trust a slick resume, a big-name employer, or a well-curated GitHub. We’ve learned that keywords and credentials don’t predict performance. Some of our worst mismatches looked perfect on paper. What matters: proof of impact, not a polished summary.

Automating Away the Work That Matters

Tools and automation are tempting—especially at scale. But we’ve seen the limits firsthand. Give too much control to algorithms, and you miss what actually matters: how a candidate explains decisions, handles setbacks, or communicates uncertainty. Automated tests can’t spot these traits.

Missing the Human Signals

Soft skills are never obvious from a code sample. Communication gaps, lack of proactivity, or cultural misfit show up only in real conversations. Early on, in developers who could pass every test but struggled when projects hit rough water. It’s why human interviews—live and unscripted—are non-negotiable.

Underestimating the Problem of Fraud

Fraudsters aren’t rare; they’re persistent. As we scaled, scammers got more creative—fake documents, well-rehearsed answers, sometimes even borrowing someone else’s career history. We’ve been burned. That’s why layered checks are standard, and why we never rely on a single step to flag risk.

Thinking the Process Is “One and Done”

Screening isn’t static. The moment you think you’ve “solved” it, a new pattern slips through. Every overhaul, every painful miss, forced us to keep adjusting—calibrating what matters, tightening up steps, and building a process that reflects what actually works, not just what’s convenient.

Lemon.io’s Approach: Lessons Learned, the Road Ahead

Today’s pipeline doesn’t look much like the one we started with, but it’s not a monument—it’s a live system that changes as fast as the world outside it. We’ve learned to treat every stage as temporary, every standard as open to debate, and every piece of feedback as a prompt to get better.

Interviewer quality is just as important as candidate quality—so we invest in regular training and ongoing assessment calibration for our recruiters and technical interviewers. Everyone on our team is expected to deepen their skills, and we constantly review how interviewers apply our scorecards in practice.

Alongside the process tweaks, we added a small but powerful checkpoint: before the soft skills interview, every developer watches a short Lemon.io intro video. It explains our business model, process, and payment methods—so expectations are set and the conversation can go deeper, faster. It’s a simple way to screen out misaligned candidates before either side invests more time.

What we do now, and why it works:

Practice	Why We Do It
Detailed scorecards for every technical interview	Enforces shared standards and fewer weak passes
Frequent calibration sessions	Keeps all interviewers aligned, removes ambiguity
Auto-flagging and reminders via Holy Place	No falling through cracks at scale
Layered checks for identity, soft skills, and docs	Stops scammers and resume “pattern-fitters”
Feedback loops with clients and devs	Turn field results into pipeline improvements
Regular process reviews and rewrites	Screening evolves as fast as the market

We’re currently developing new internal tooling to help compare technical interview results and make developer strengths clearer for clients. The goal is a scorecard that’s easy for clients to interpret and that highlights the signals that matter most for each role.

And, true to our own values, we hold ourselves to the same expectations we set for our candidates. If we demand communication, responsibility, and ongoing growth from our developers, we expect the same of every Lemon.io team member—recruiter, reviewer, or manager. That’s the only way a high-bar process stays real.

What does this look like in reality? Suppose a client flags a developer as a perfect technical fit, but communication falters mid-project. We’ll revisit the full assessment:

Was something missed in the soft-skills interview?
Did our calibration slip?

That root cause—whether it’s a step skipped or a signal misunderstood—will spark a rewrite in how we interview, update Holy Place, or retrain the team.

The result: Every pain point or surprise becomes the fuel for the next overhaul. That’s how we make the vetting sharper—and why no single step or tool is sacred if there’s a better way to serve clients and devs.

Why This Matters (and How It Makes Your Next Hire Better)

Let’s get honest: nobody reads a thousand words about hiring because they love process diagrams. You care about results—shipping better software, working with people you trust, and avoiding the headaches that come with bad hires.

That’s where all this relentless screening, scoring, and tweaking pays off.

For clients, a real vetting system means you get a developer who can hold their own from day one—not just on code, but on feedback, deadlines, and the unpredictable bumps that happen in every real project.

For developers, it’s a shot to join teams where they’re respected for more than just a stacked resume—where expectations are clear, fraudsters are filtered out, and good work actually gets noticed.

The right vetting process saves you time, money, and stress on both sides of the table. It’s not about passing tests—it’s about making work better, faster, and a whole lot more human.

Want to work with truly vetted developers? Start your next project with Lemon.io today.

Author

Zhenya Kruglova

Head of Talent Acquisition at Lemon.io

Talent acquisition expert Zhenya Kruglova leads strategic hiring at Lemon.io, combining nearly a decade of experience with a focus on empathy, structure, and long-term team fit