Evidence-Based Hiring
Evidence-Based Hiring
Zeljko Svedic
License
https://creativecommons.org/licenses/by-nd/4.0/legalco
de
1
CHAPTER 6: The New Model in Detail . . . . . . . . . 53
Everything We’ve Learned: Validity, Reward
vs. Effort, Good Questions . . . . . . . . . 53
The Job Ad . . . . . . . . . . . . . . . . . . . . . . . 55
The Application Form . . . . . . . . . . . . . . . . 59
Screening Résumés . . . . . . . . . . . . . . . . . 64
Communicating With Candidates . . . . . . . . . 65
Detailed Screening Test . . . . . . . . . . . . . . . 67
Structured Interviews . . . . . . . . . . . . . . . . 69
Probing Motivation and Character . . . . . . . . 72
Second Opinions . . . . . . . . . . . . . . . . . . . 78
Negotiating the Contract . . . . . . . . . . . . . . 80
Wrapping It Up . . . . . . . . . . . . . . . . . . . . . 81
CHAPTER 7: We’re Not Done Yet: Continuous Im-
provement . . . . . . . . . . . . . . . . . . . . . . . 83
CHAPTER 8: Conclusion . . . . . . . . . . . . . . . . . . 87
BONUS CHAPTER: Screening Software Developers . 90
How Testing Platforms Work . . . . . . . . . . . . 95
Conclusion . . . . . . . . . . . . . . . . . . . . . . . 100
One more thing. . . . . . . . . . . . . . . . . . . . . . . . 102
About the Author . . . . . . . . . . . . . . . . . . . . . . 103
2
CHAPTER 1: The Failure of
Traditional Employment
Screening
The traditional hiring process is broken. It’s a throwback
to an industrial era hundreds of years ago, when we didn’t
have employment science and relied on our hunches in-
stead of data. If you’re reading this book, you probably
feel the same way.
My First Hire
Many of us make the same mistakes when first hiring.
I remember posting my first job ad: I was proud, as it
was a sign that my little company was becoming serious.
Customers were loving my product and I needed a software
developer to help me out. It was obviously going to be the
first hire of the many thousands that would follow, putting
me on a path to me becoming the new Steve Jobs.
3
XML to algorithms. They had more experience than me,
and they’d worked on cool projects at previous companies.
I started to expand my future company vision. Was I think-
ing too small? Maybe I could also do a hostile takeover of
Microsoft and Amazon?
The first candidate arrived and I sat him down with a glass
of orange juice. We talked about his résumé and work
experience. I explained what the job was, he confirmed
that he was a very good fit for it. Great. We began talking
about IT in general. I enjoyed the conversation, so I didn’t
want to derail it by asking questions that may break the
rapport that we were developing. At the same time, a
nagging voice in my head told me that I needed to test the
candidate, not just chat with him.
4
job required. To my surprise, he clammed up, struggling to
answer even half the questions adequately. The interview
swerved off the rails, crashing into a ravine of my mis-
placed optimism and his bad bluffing. He was not qualified
for this job.
5
The Bell Curve
When I look back at those days, a decade ago, I can’t
help but break out into a smile. My simple mistakes are
funny in retrospect, because most first-time interviewers
make them. My biggest mistake was not understanding
the hiring bell curve.
6
This distribution is used as a basis for scoring in every
school, country, and domain of knowledge. While you
would think that everybody who finishes medicine or
physics is equally good, as those professions attract top
performers, when you plot results of medicine or physics
students, you also get the bell curve. Yes, they know
much more about medicine or physics than the average
person, but, inside the domain, there are still enormous
differences between top and low performers.
7
evenly among people or that we’re not all equal. But, this
inequality is a reality that academia and businesses must
accept.
8
let them go rather than offering them a pay raise to stay?
Hiring is a case of asymmetric information—the candidate
and their previous employers have much more information
than you can extract in a one-hour interview.
9
have an insane number of applicants. In 2014, 3 million
people applied to Google2 , and only 1-in-428 of them were
hired. There is no way to screen a population which is
the size of Mongolia without resorting to some type of
automated screening. But, Google is still a small company.
If Walmart had a 1-in-100 selection criteria, they would
need to screen the entire working population of the US
and still find themselves 25 million people short3 . Before
I had experience in hiring, I would often get angry with
incompetent customer support representatives, delivery
staff, or salespeople. How could a company hire them
when they’re so incompetent? Now, I know better. I
imagine myself being a head of a department hiring 500
people in a new city—for a below-average salary. Large
enterprises hire on such a massive scale that they’re more
focused on screening out the bottom 20% than chasing
the hallowed top ten on the illusive far right of the bell
curve.
This might sound like a bad thing, but it’s not, as it means
a large majority of society can find employment. Hiring
knowledge workers is not hiring on a massive scale. You
can screen for the best and give them better benefits than
the competition. A great employee that delivers twice as
much is well worth a 50% higher salary.
2 Quartz article: https://qz.com/285001/heres-why-you-only-have-a-0-
2-chance-of-getting-hired-at-google/
3Wikipedia: https://en.wikipedia.org/wiki/List_of_largest_United_States-
based_employers_globally
10
How Can We Fix Employment
Screening?
For all the reasons discussed, the average hiring process
just doesn’t cut it. The following chapters will teach you a
new method called Evidence-Based Hiring. You will learn:
11
• Training and development: how to keep employees
skilled, happy, and motivated.
12
CHAPTER 2: The Current
Model Doesn’t Work Because
We’re All Biased
Before we can fix what’s broken, we first need to under-
stand why it’s broken and who broke it.
13
The answer is simple: the surgeon is the boy’s mother.
Were you able to solve it, and how many seconds did it
take you to find the answer?
You might think that you don’t have a gender or race bias,
but even people in an in-group can be biased against its
other members. Psychologists Kenneth and Mamie Clark
created “the doll tests”, where kids are given a choice of a
few dolls with different skin tones5 . African-American kids
overwhelmingly choose white dolls, as they “are prettier”
and “better.”
14
Screening by Proxy
When hiring, as we already discussed, we have a lack of
information, and so we are likely to use proxies to make a
decision:
15
feel right. A hidden part of my brain is telling me it has
never seen a surgeon like this. Situations that are familiar
make us relaxed, while situations that are unfamiliar make
us stressed and uncertain. Meanwhile, another part of
my brain is evaluating my social situation—what will my
coworkers say if I pick such a strange candidate? Is it worth
the risk?
man-s-true-character-by-the-way
10Wikimedia: https://commons.wikimedia.org/wiki/File:Bundesarchiv
_B_145_Bild-F051673-0059,_Adolf_Hitler_und_Eva_Braun_auf_dem_Ber
ghof.jpg
11Wikipedia: https://en.wikipedia.org/wiki/Animal_welfare_in_Nazi_Ger
many
16
Adolf Hitler, Eva Braun, and their dogs.9
17
etarian12 and was planning to ban slaughterhouses after
WWII because of animal cruelty. He was also a non-smoker
and non-drinker. Hitler’s mistress, Eva Braun, loved him
so much that she decided to share his faith. They got mar-
ried in his underground bunker and, 40 hours later, they
committed suicide together13 .
So, who gets votes? Well, all American presidents for the
past 130 years, except one, had owned a dog14 . All but
two of them were married15 . All but two were Christians16 .
All, except one, were white. No matter what people say
they look for in a head of state, an older, white, religious
male with a wife and a dog fits a stereotype of a good
president.
uicide
14Wikipedia: https://en.wikipedia.org/wiki/United_States_presidential
_pets
15 Quora: https://www.quora.com/Has-there-ever-been-a-U-S-president-
who-was-not-married
16Wikipedia: https://en.wikipedia.org/wiki/Religious_affiliations_of_Pre
sidents_of_the_United_States
17 Book by Walter Isaacson: Einstein: His Life and Universe
18
professor of physics refused because Einstein called his
lectures “outdated” and questioned why he didn’t teach
modern theories. As a result, Einstein was the only one
in his class of students who couldn’t find employment in
his field. He was forced to work at a patent office, and
only managed to get an academic job a few years after
he published his special theory of relativity. If the main
criterion for examination had been creating original physics
papers, or if the testing of students was blind, it’s likely a
young Albert could have shown what he had to offer. In
the case of Einstein, his professors used a “did the student
attend my lectures” proxy and a “did the student like my
course” proxy.
19
2. The telephone interview - here, we tend to ask candi-
dates very simple questions which are related to their
previous experience and résumé. Delivered this way,
it is not a knowledge or work test. Instead, it’s more
of a short communication skills test that extroverts
will ace. Even if the job requires communication skills,
this doesn’t mean a phone call is a good proxy for
work-specific communication.
18 Paper:
https://www.researchgate.net/publication/313878823_The_i
mportance_of_first_impressions_in_a_job_interview
20
CHAPTER 3: Not All Methods
Are Equal
Since we’re all biased and we use incorrect proxies, why not
just outsource hiring to experts or recruitment agencies?
After all, they’ve been screening people for many years, so
they must know how to do it right?
21
we wouldn’t think of only hiring Libras because they are
supposedly curious and outgoing.
US/Products%20and%20Services/Myers-Briggs
22 16personalities: https://www.16personalities.com/personality-types
23 Psychometric Success: http://www.psychometric-success.com/person
ality-tests/personality-tests-popular-tests.htm
24 Article: http://www.indiana.edu/~jobtalk/Articles/develop/mbti.pdf
25 Article: https://www.forbes.com/sites/toddessig/2014/09/29/the-
mysterious-popularity-of-the-meaningless-myers-briggs-mbti/
26 Article: https://op-talk.blogs.nytimes.com/2014/07/18/why-myers-
briggs-is-totally-useless-but-wildly-popular/
27 RMi: https://www.rmiexecutivesearch.com/disc-assessment
28Wikipedia: https://en.wikipedia.org/wiki/Ipsative
29 People Success: http://www.peoplesuccess.co.uk/behavioural-style/va
lidity/
22
was just face validity30 .
263c1a4d0e1e1119668563274d242e.pdf
23
correlated.
24
Validity of different selection methods.
psychoanalysis-of-job-hiring
36 Wikipedia: https://en.wikipedia.org/wiki/Graphology
37 IPAC presentation: http://www.ipacweb.org/Resources/Documents/co
nf13/white.pdf
38 Section from OPM.gov: https://tldrify.com/pgu
25
Method Validity Description
Job experience (years) 0.18 Years of experience in the job correlates only slightly with the job
performance, and, thus, should not be used. It’s often a deciding
factor since it’s easy to determine from a candidate’s résumé and,
somehow, feels, intuitively, like it should be valid—it isn’t.
26
Method Validity Description
Job knowledge tests 0.48 Question the specific professional knowledge required for the job.
This method has high validity, but can pose a problem when
screening junior candidates who require training.
Peer rating 0.49 Asking coworkers to evaluate a candidate’s performance and
averaging the results is surprisingly valid. People inside a
company have a better insight of each other’s abilities. A good
method for in-company promotion or reassignment, but not for
hiring outside employees.
General mental ability 0.51 GMA tests measure the ability of a candidate to solve generic
(GMA) tests problems, such as numerical reasoning, verbal reasoning, or
problem solving. They don’t guarantee that a candidate has the
required skills, just the mental capability to develop them if
trained. Note that brainteasers like “how would you move mount
Fuji?” are too vague and subjective for a GMA test.
Employment interviews 0.51 The same interviewer asks different candidates identical
(structured) questions, in the same order, writes down the answers (or, even
better, records the entire session), and gives marks to each
answer. This way, different candidates can be transparently
compared using the same criteria.
Work-sample tests 0.54 To test if a candidate will be good at work, give them a sample of
actual work to do. A simple and very effective idea.
27
CHAPTER 4: Reward vs Effort
The Last Question
So, we need to use multiple methods. But, in which order?
Say that you apply for a Sales Manager job at the Acme Cor-
poration. First, they conduct a phone interview. Then, they
call you for a face-to-face interview. After that, you visit
for a whole day of testing: aptitude tests, situational tests,
personality tests, the whole shebang. The Friday after that,
you have an on-site interview with your prospective man-
ager. Ten days later, you have an evening interview with
his manager. More than a month has passed since you
applied, and you’re five rounds in. Annoying, since you’re
currently employed and need to make excuses to go for
each interview or test. Finally, they send you an email
stating they want to make you an offer. Again, you come
to the shiny Acme Corporation office for a final talk with
both managers. They offer you a salary you are satisfied
with—great. You wipe the sweat from your forehead and
relax into your chair.
28
them, just because they forgot to ask one question.
How you order your screening process makes all the differ-
ence.
29
Spanish fluency gives less information than an on-site in-
terview, but it requires only a one-minute phone call to
know for sure whether you can reject them. It is vastly
more efficient.
31
number of candidates. If you have 20 candidates and an
average applicant takes five hours to screen, failing after
the knowledge test, the total effort is a huge 100 hours.
32
is the candidate themself, and the method is candidate self-
selection. Candidates don’t want to be failures: they want
to succeed in their job. If your job description indicates that
they are probably going to fail if they were hired, they’re
not going to apply.
33
Sounds obvious, but I used to make the mistake of leaving
out one crucial part of the job description, just because
other companies omitted it too—the salary range. While I
knew exactly what salary range my company could afford,
salary negotiations are difficult and awkward, so I left them
for the end. As a result I spent hours and hours interviewing
some senior candidates, only to discover that—aside from
a unforeseen lottery win—I could not afford them. I wasted
their and my time.
34
within it. So, the next chapter is dedicated solely to the art
of asking good questions.
35
CHAPTER 5: Maximizing the
Reward: How to Ask Good
Questions?
The general rule for screening questions in both tests
and interviews is simple, but very powerful once you’ve
wrapped your head around it:
36
Great Questions Cheat Sheet:
Bloom’s Taxonomy
Fortunately, there’s one high-level concept that you can
learn in 15 minutes, which will make you much better at
asking elimination questions.
37
Bloom’s revised taxonomy.
38
• Creating: write a poem, essay, or fictional story.
Level 1: Remembering
Let’s suppose you are hiring an astronomer. You could test
basic knowledge with the following question:
39
How many planets are in the solar system?
• 7
• 8 (correct)
• 9
• 10
• 11
This is not only a problem for online tests: it’s also trivial
to cheat in a supervised classroom setting. All you need is
a smartphone in a bag, a mini Bluetooth earphone bud41 ,
and a little bit of hair to cover your ear. It doesn’t take a
41 Bluetooth Earbud on Amazon: http://a.co/fxz9Wbm
40
Google answering question from Level 1.
For some reason, many employers and even some big test-
ing companies love online tests with trivia questions, such
as “What is X?” They think having copy-paste protection
and a time limit will prevent cheating. In my experience,
they merely test how fast a candidate can switch to an-
other tab and retype the question into Google.
Level 2: Understanding
How can you improve the Pluto question? You could re-
frame it to require understanding, not remembering. For
example, this is better:
41
Why is Pluto no longer classified as a planet?
• It is too small.
• It is too small.
• It is too far.
Level 3: Applying
Let’s presume that our imaginary candidates are applying
to work at an astronomer’s summer camp. They need to
42 As of 2017, the first answer on Google was: https://www.npr.org/temp
lates/story/story.php?storyId=5653191
43
organize star-gazing workshops, so the job requirement is
to know the constellations of the night sky.
• A
• B
• C
• D
• E
• F (correct)
44
This question requires candidates to recognize the Big Dip-
per43 and Little Dipper44 constellations, which helps to
locate Polaris (aka The North Star). The picture45 shows
the real night sky with thousands of stars of different in-
tensities, which is a tough problem for a person without
experience.
Level 4: Analyzing
Unlike the apply level, which is straightforward, analyzing
requires test-takers to consider a problem from different
angles and apply multiple concepts to reach its solution.
45
maximum star wobbling velocity of ±1 m/s. If
we presume that this wobbling is caused by a
perfectly round orbit of a single gas giant, and
this gas giant’s orbit plane lies in a line of sight,
then calculate the mass of this gas giant.
• 8.52 x 1026
• 9.01 x 1027
• 3.89 x 1027
• 7.64 x 1028
The above text box contains HTML with four errors and
a Run button which provides feedback to the candidate
on the debugging progress. As fixing invalid code is the
bread and butter of front-end development, this kind of
task easily filters out candidates with no HTML experience.
17629
46
Online Inspector question.47
Level 5: Evaluating
The next level, evaluating, demands more than merely
analyzing or applying knowledge. To be able to critique
literature or architecture, you need to have a vast knowl-
edge of the subject to draw from. Typical questions in the
evaluating category might be:
47
• How would you prioritize Z?
Level 6: Creating
Creating is the king of all levels. To create something
new, one needs not only to remember, understand, apply,
analyze, and evaluate, but also to have the extra spark of
creativity to actually make something new. After all, it’s
50 Article: http://www.nwlink.com/~donclark/hrd/bloom.html#revised
48
this key skill that separates knowledge workers from other
workers.
49
question which screens data scientists, by asking them to
create a single function in Python, see the next figure.
eting-costs/11855
50
an infinite number of solutions to an architectural problem.
Yet, you will probably be surprised to learn that, since 1997,
certification for architects in the United States and Canada
is a completely automated computer test. The ARE 4.0
version of that test53 contains 555 multiple-choice ques-
tions and 11 “vignettes.” A vignette is actually a full-blown
simulation program where the aspiring architect needs to
solve a problem by drawing elements. For example, the
Structural Systems Vignette from the next figure asks a
candidate to complete the structural framing for a roof,
using the materials listed in the program.
mination#ARE_4.0
51
chance for examiner bias, nepotism, or plain corruption in
the examination process.
52
CHAPTER 6: The New Model
in Detail
Now, we have all the building blocks we need. Let’s do a
quick recap and then explain our new screening process
step by step.
• Work-sample tests.
53
the criteria to exclude a candidate, it is not relevant for
screening. All steps together form a funnel where the most
effective methods start at the top of the funnel, see the
next figure.
Hiring funnel.
The Job Ad
Take a look at this sample job ad:
56
If you take a look at the Rockstar Developer job ad again, it
fails to mention many things that Joe and other candidates
would find relevant:
• Work schedule.
• Overtime work.
57
Our office is downtown and parking is in the same building.
We are a team of 15, who are mostly younger, informal,
and very diverse. Most of the people in the office work
from 10am until 7pm, with a long lunch break. We often
have beers or dinner after work. Because of deadlines,
we sometimes work on weekends, but compensate with
overtime pay or extra vacation days. You are expected
to travel at least two times a year, to conferences in Las
Vegas and New York.
58
Emma found this job to be a perfect match. She is young
and single, often going out and sleeping late into the morn-
ing. Hanging out with coworkers is really important to her.
The salary is in her expected range, and she would even
get equity. She doesn’t mind working some weekends, and
she can use the extra vacation days for her next trip to
Thailand.
If you don’t know what criteria to put in the job ad, think
of the most common reasons why your candidates fail at
interviews. Probably, you can add text to the job ad that
would have deterred them from applying in the first place.
59
in Chapter 2: Proxies and Biases in Hiring, most people
have a polished résumé, full of buzzwords and examples
of how they’ve changed the world. Switching to an appli-
cation form makes a huge difference.
Why?
g/profit-per-employee/10960
60
Online Profit per Employee question54 .
61
To our great surprise, so many people failed this applying
question that we have classified it as hard in our testing
system. Don’t get me wrong, I don’t think candidates
who fail this question are bad workers, but, between two
candidates, I prefer to hire a person who doesn’t need help
interpreting data.
header/11688
62
Again, as with the Job Ad, if you’re not sure what questions
to ask in your application form, think of basic questions
which candidates fail in your interviews.
Screening Résumés
The next step is reading résumés. Résumés should be
read as what they are, a marketing brochure. We need
to wade through the hyperbole haystack in search of the
fact-needles that we require. To do so in a structured way,
first, make a list of relevant requirements before reading
any résumé. This will be easy, since these are already
stated in the job ad. For example, if you are searching for
a marketing manager, the requirements might be:
64
less obvious, however, that you shouldn’t give positive
points for the same things. This is called “positive discrimi-
nation” and some companies use it in an effort to increase
diversification. Mathematically, giving two positive points
to a certain ethnicity is equal to giving two negative points
to all other ethnicities. It’s just negative discrimination in
disguise.
65
let good candidates know that:
67
erarchy level that is required for the job. If you make the
level too low, you’ll end up with employees who can only
do the basic tasks. If you make the level too high, they will
be overqualified and bored.
Accountant “File the following ten invoices into the accounting system. If the invoice is missing
some details, select what is missing in the More information needed dropdown.”
Customer Analytics “Given the following Excel pivot table, find the count and total of requested customer
refunds in Germany for December which were not paid out by the end of that month.”
Factory Product Manager “Calculate cost variance for a project with the starting budget X (spread equally over
duration) and expected duration Y, if you are currently A months into the project and
the money spent so far is B.”
Translator “Below is the example of an original and translated text. Unfortunately, the translated
text has seven words that are mistranslated. Select each of them and provide a word
that fits the context.”
Programmer “Change the following HTML code, so the page formats correctly for screens with a
width of 480px.”
Journalist “Given the following facts, write a title and a lead paragraph of a maximum of 35
words.”
Customer Service Agent “Below is an email from an angry customer whose delivery failed to show up on time.
Our system indicates that the delivery was late because of road traffic. Write your
response to the customer.”
It’s both not fair and not necessary to give applicants long
work-sample tests, such as asking a translator to translate
a few pages from a book or asking a programmer to create
a simple app. Although this approach is popular, it takes up
too much of candidate’s time, they can feel they are doing
free labor, and their creations are difficult to evaluate.
Structured Interviews
With the detailed screening test completed, we have now
eliminated most of the candidates. We only need to in-
terview the few who remain. Interviews come in a few
forms; phone, video call, and on-site interview. No matter
which medium of communication is used, the problem is
the same. How do you objectively compare candidates?
69
• The company keeps records of all interviews for future
analysis.
All the tools that you need for structured interviews are
available free of charge. We use Google Docs for questions
and scoring, as well Skype and Call Recorder58 for calls.
Our interviews look like in the next screenshot.
70
Question Scoring criteria Answers and marks (1-5)
Tell me about a time when you had Wrong: Pushing your idea onto
to make someone enthusiastic others.
about your idea or problem so that
they would want to help you? Correct: Having empathy to
understand the motivations of
others and present them something
in a way that they will like.
Imagine that you are managing an Wrong: Threatening them.
expert who repeatedly misses
deadlines. How would you handle Correct: Following up a couple of
this? times to understand the obstacle
that’s preventing them doing their
job. If it repeats three or four times,
find a replacement contractor.
Imagine that your job changes Wrong: Working harder or hiring a
overnight, going from managing subordinate to assist you.
five experts to 20. How would your
supervision style change? Correct: Move from a personal to a
more systematic
approach—documenting, tracking,
and measuring everything, using
reminders.
72
health problems and family issues. Still, he was smart and
I really wanted to hire him. He assured me that he could
balance his remaining exams with work. Just to be on the
safe side, we agreed on a four-day workweek, and signed
the employment contract.
This saga continued for months until, one day, his manager
came to me. “I think we have a problem,” he said. “Mike’s
been logging normal 9-to-5 working hours but has not been
on Skype which made me suspicious. So, I checked the
commit logs. Mike’s been committing a little bit of code
every day at 11PM but not working at all during the day.”
Mike had pulled the four-hour workweek59 trick on us.
Anywhere/dp/0307465357
60Wikipedia: https://en.wikipedia.org/wiki/Big_Five_personality_traits
73
conscientious, and diligent,” “committed to his job,” and,
“when he starts something, he finishes it”.
Accepting all this, how can you probe motivation and char-
acter? Well, you have to be willing to give a candidate
tough questions, while trying to find inconsistencies in
their story.
74
at this:
75
Question Scoring criteria Answers and marks (1-5)
How did you hear about the Is the candidate selective in applying
position and what motivated for a job? Is the candidate motivated
you to apply? by this specific work?
76
Question Scoring criteria Answers and marks (1-5)
What are the first steps of your Do they invest time talking with the
design process and how much client or understanding the target
time do you spend on each? audience first?
Have you had a customer-facing job Do they show enthusiasm for this
before? If you had it, how did you type of work? How realistic are they
like it and what were its good and in their assessment of it?
bad sides?
Red flags: No relevant experience.
Unrealistic idea of the job.
Give me an example of the last two Do they show enjoyment for
times that you helped a friend, explaining technology? Do they just
family member, neighbor or jump in and solve the problem or
stranger with a technical issue? actually take the time to explain it
How did you approach the problem? to the user, so that they could solve
it themselves next time?
Second Opinions
Interviews don’t require the effort of just one person, they
require collective effort to get right. Generally, having
multiple people involved in the interview process is better
because:
78
be reviewed by another interviewer as the basis for
eliciting a second opinion.
80
more junior than desired, tell them this. If their technical
skills in Java disappointed you, inform them that they need
to improve. This way, there are no big surprises when it
comes to the negotiation and the salary you offer.
Wrapping It Up
Once you have a hire, before you can go out and celebrate,
you need to wrap everything up properly. Contact the other
candidates to inform them that you’ve hired someone else.
Explain the reasons why you preferred the other candidate,
so that they know how to improve in the future. Keep
81
bridges open—if you leave them with a good impression,
perhaps they’ll come to you later in their career. If they
were your second choice for the hire, tell them so. We
once had a situation where our primary pick turned out to
have motivational problems when working remotely. We
terminated his contract and contacted our second pick
who previously didn’t get the job because he had less
experience, but was otherwise great. He is now employed
by us and we’re very satisfied with his work.
82
CHAPTER 7: We’re Not Done
Yet: Continuous Improvement
How do you know if your Evidence-Based Hiring process is
any good? There is only one way to tell—look at the data.
83
Plan → Execute → Measure → Improve cycle.
Does screening of each résumé take too After 5-10 résumés Create a list of selection criteria or
much time? shorten your existing one.
Do applicants have the desired level of After 10-20 résumés If you want more experienced
experience? screened applicants, update the Job Ad to target
them:
- Raise salary level.
- Explicitly state that you’re looking for
experience.
- Update company branding to reflect
employee age diversity.
Are too many applicants applying? After 50-100 Revise the Job Ad to include more
applications requirements, and make sure that the
application form has questions that
eliminate candidates.
Are the questions ambiguous? Immediately after writing Send questions to, at least, one
questions other coworker for review.
84
Preparing questions checklist When to check? Corrective action
Are the answers to the questions available Immediately after writing Rephrase question using
online? questions suggestions from Chapter 5: Level
2: Understanding to make it less
googleable.
Do the majority of candidates answer the After 5-10 If everybody answers the question
question correctly? candidates tested correctly, it has no predictive value.
Make it harder, or replace with another
question.
Do all of the candidates answer the question After 10-30 Since you’re searching for a top-10%
incorrectly? candidates tested candidate, most applicants should get
your questions wrong. But, if 100% fail,
it’s likely the question, not them. Make it
easier, or replace with another question.
Do enough people answer the question within After 10-30 Fast solvers often need only half the
the allotted time? candidates tested allotted time. If most of the correct
answers as submitted close to the
maximum time, increase that maximum.
Do many, otherwise good, applicants give the After 10-30 There is a high probability that they are
same, incorrect answer? candidates tested right and you are wrong. Double-check
the correct answer.
Do interviews overrun their allotted time? After 2-3 candidates Ask a more experienced interviewer to
interviewed sit in and give feedback. An
interviewer should talk less than 30%
of the time, whilst still controlling the
flow of the conversation.
Do candidates have a wrong impression of After 3-5 candidates The Job Ad is probably misrepresenting
the job? interviewed the job. Ensure that every key task,
responsibility, perk, and expectation is
listed there.
Do negotiations with After 2-3 failed Be more transparent with candidates in the entire
potential hires often fail? negotiations process after every step, from the Job Ad to the
negotiation.
Do you regularly lose After 2-3 lost candidates Audit your process against similar companies. How
promising applicants to long are they taking to make an offer? Increase the
other companies during speed of your screening process.
the screening process?
Does a new hire fail to After 2-6 months of Check if the job requires a different set of skills than
perform on the job? employment what is tested in the screening process.
85
Post-hire checklist When to check? Corrective action
High turnaround of new After 6-18 months of Get feedback from people leaving the company. Use
hires? employment that knowledge to improve motivation questions
asked at interview.
86
CHAPTER 8: Conclusion
In my modest opinion, we don’t live in a world of equal
opportunities.
sb/images/multistatic/24/Image/1.jpg
63 Paper (Croatian): http://www.hde.hr/sadrzaj.aspx?Podrucje=171
87
side the room while the evaluator computer calculated and
printed the results. When I have finally read the printed
results, my name was not in the middle or at the bottom of
the list. It was at the top. The evaluator computer had de-
cided that I was the best young programmer in the country,
for that year.
_by_average_wage
88
get a Swiss job.
ecommuting-statistics
66 Upwork study: https://www.upwork.com/i/freelancing-in-america/20
16/
67 Article: https://www.class-central.com/report/mooc-stats-2017/
89
BONUS CHAPTER: Screening
Software Developers
Since my background is in programmer screening, what
follows is a detailed guide for this. Note that this guide is
not designed for a general audience because programming
differs from other jobs in multiple ways.
90
This effect has been observed for a long time. Imran Gorky
wrote in 2007:
...
public static double Average(int a, int b) {
return a + b / 2;
}
...
Can you find both bugs in the allotted time of five minutes?
We didn’t want it to be a trick question, so we also provided
reasons why the formula failed:
70 Blog post: https://imranontech.com/2007/01/24/using-fizzbuzz-to-
find-developers-who-grok-coding/
71 C2 wiki: http://wiki.c2.com/?FizzBuzzTest
91
These hints make it obvious that operator priority makes
the calculation wrong (primary school math), and that an
integer divided by another integer results in an integer
division (this is the same in C++, Java, or C#). But, even if
a candidate can’t find the bugs with the current code, they
could write their own. A good programmer should be able
to write an average method in less than 30 seconds.
92
The first test was short, and part of the candidate applica-
tion process. The key was to focus on just the two or three
most important job requirements that can be automatically
tested.
94
question asked to calculate where a grasshopper
would be after N steps.
Most testing platforms use the text input and output ap-
proach, but we opted to use unit tests. It’s easier for
candidates and allows us to test many more things. Be-
cause we have HTML/CSS unit tests running in headless
Chrome72 , we can ask such questions as the one in the
next figure.
96
In addition to simply testing if a program’s output is valid,
we can also test algorithmic complexity, performance, and
memory consumption.
notation/
74Wikipedia: https://en.wikipedia.org/wiki/Big_O_notation#Orders_of_c
ommon_functions
75 TestDome: https://www.testdome.com/questions/c-sharp/binary-
search-tree/12976
97
12 MB, and we know that the optimal solution takes 10
MB, then candidate solutions which take more than 20% of
optimal memory will fail.
t/benchmarksgame/how-programs-are-measured.html#source-code
98
than a shorter/more readable program which doesn’t work.
99
dates who applied, only to discover later that they had no
intention of doing the work. After passing the screening
process, they delegated the work to an inexperienced per-
son, while the expert would be busy applying for further
jobs to outsource. Having such an “interview expert” is
a common tactic for outsourcing agencies in developing
countries.
Conclusion
For HR departments the easiest way to hire is to have the
same process for every applicant, and to not have sepa-
rate tools for developers, students, salespeople, etc. But if
80Wikipedia: https://en.wikipedia.org/wiki/Proctor#Online_Proctoring
100
you screen developers like every other employee, you are
wasting your time and not getting the best talent. Program-
mer screening is quite advanced and there are 20 years of
experience in using better tools than a phone or a white-
board. Live coding, automated checking of correctness
and memory consumption, coding replay, code analysis,
and online proctoring are all available on various platforms
where you can even enter your own custom questions. And
let’s not forget that such testing is asynchronous, mean-
ing candidates can take it whenever works best for their
schedule and avoid traveling to your office.
101
One more thing. . .
Thanks for finishing the book!
102
About the Author
103