Searching: Kruse and Ryba CH 7.1-7.3 and 9.6

Serial search has average case time complexity of O(n) as it may need to examine all records to find a match. Binary search works on sorted data and has average and worst case time complexity of O(logn) as it divides the search space in half each step. Hash tables provide the best performance of O(1) on average by using a hash function to directly map a key to an array index, avoiding the need for comparison of keys.

Uploaded by

Evans Red

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views

Searching: Kruse and Ryba CH 7.1-7.3 and 9.6

Uploaded by

Evans Red

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 64

Searching

Kruse and Ryba

Ch 7.1-7.3 and 9.6
Problem: Search
We are given a list of records.
Each record has an associated key.
Give efficient algorithm for searching for a
record containing a particular key.
Efficiency is quantified in terms of average
time analysis (number of comparisons) to
retrieve an item.
Search
[0] [1] [2] [3] [4] [ 700 ]

Number 281942902 Number 233667136 Number 580625685
Number 701466868 Number 506643548 Number 155778322

Each record in list has an associated key. Number 580625685

In this example, the keys are ID numbers.

Given a particular key, how can we

efficiently retrieve the record from the list?
Serial Search
Step through array of records, one at a time.
Look for record with matching key.
Search stops when
record with matching key is found
or when search has examined all records
without success.
Pseudocode for Serial Search
// Search for a desired item in the n array elements
// starting at a[first].
// Returns pointer to desired record if found.
// Otherwise, return NULL

for(i = first; i < n; ++i )

if(a[first+i] is desired item)
return &a[first+i];

// if we drop through loop, then desired item was not found

return NULL;
Serial Search Analysis
What are the worst and average case
running times for serial search?
We must determine the O-notation for the
number of operations required in search.
Number of operations depends on n, the
number of entries in the list.
Worst Case Time for Serial Search
For an array of n elements, the worst case time
for serial search requires n array accesses: O(n).
Consider cases where we must loop over all n
records:
desired record appears in the last position of
the array
desired record does not appear in the array at
all
Average Case for Serial Search
Assumptions:
1. All keys are equally likely in a search
2. We always search for a key that is in the array
Example:
We have an array of 10 records.
If search for the first record, then it requires 1 array
access; if the second, then 2 array accesses. etc.
The average of all these searches is:
(1+2+3+4+5+6+7+8+9+10)/10 = 5.5
Average Case Time for Serial Search

Generalize for array size n.

Expression for average-case running time:

(1+2++n)/n = n(n+1)/2n = (n+1)/2

Therefore, average case time complexity for serial

search is O(n).
Binary Search
Perhaps we can do better than O(n) in the
average case?
Assume that we are give an array of records
that is sorted. For instance:
an array of records with integer keys sorted
from smallest to largest (e.g., ID numbers), or
an array of records with string keys sorted in
alphabetical order (e.g., names).
Binary Search Pseudocode

if(size == 0)
found = false;
else {
middle = index of approximate midpoint of array segment;
if(target == a[middle])
target has been found!
else if(target < a[middle])
search for target in area before midpoint;
else
search for target in area after midpoint;
}