Google is not enough




                Fall 2009, Class 2
                Prof. Scott Moore




          http://www.howcanifindit.com/
Finding information
 can be frustrating.
You search
in your job
search, for
classes, for
   personal
   reasons.
Lots more information available now.
And lots more
technology
(hardware and
software), too.
Knowledge is
now the ability
to learn and the
ability to find
Tools are evolving for specialized needs.
Multiple tools
are needed to
meet your
diverse and
changing needs
The changing context
         of search is
          driving the
            need for
          new tools.
Changing operating context

         Then   Now
Changing operating context

         Then    Now

       Experts   You & me
Changing operating context

                Then    Now

             Experts    You & me

 Well-defined queries   Ill-defined
Changing operating context

                  Then    Now

               Experts    You & me

   Well-defined queries   Ill-defined

Thousands of documents    Billions of documents
Understanding
search engines
will help you be a
better searcher.
Search tools can have several functions

                           Search tools




s




s
Search tools can have several functions

                                  Search tools
              generate



      Query results
      & structure
s




s
                         Document set
Search tools can have several functions

                                    Search tools
              generate
                          explore

      Query results
      & structure
s




s
                         Document set
Search tools can have several functions

                                      Search tools
              generate
                          explore

      Query results         monitor
                            changes
      & structure
s




s
                         Document set
Categorizing search engines
Categorizing search engines
                 Query terms
Categorizing search engines
                 Query terms
                and
Categorizing search engines
                 Query terms
                and



                          or
Categorizing search engines
                               Query terms
                              and



                                        or




              Search target
       addresses    images
   scholarly                 blogs
             maps news
HTML                      PDF
      financial video            books
Categorizing search engines
 Indexed information                Query terms
                                   and



                                             or



    Documents
                   Search target
       addresses    images
   scholarly                 blogs
             maps news
HTML                      PDF
      financial video            books
Categorizing search engines
      Indexed information                Query terms
                                        and
  (Full text)
Search engine
                                                  or
    Text


           Documents
                        Search target
            addresses    images
        scholarly                 blogs
                  maps news
     HTML                      PDF
           financial video            books
Categorizing search engines
      Indexed information                     Query terms
                                             and
  (Full text)
                       Directory
Search engine
                                                       or
    Text               Meta info


           Documents
                             Search target
            addresses    images
        scholarly                 blogs
                  maps news
     HTML                      PDF
           financial video            books
*
                    +
                              ~
                         -
Different search        not
                                    “”
engines support                   intitle:
different special        link:
search terms &                       inurl:
operators.              site:
Evaluating a search engine’s performance
Evaluating a search engine’s performance



     Relevant
Evaluating a search engine’s performance



                           Retrieved
Evaluating a search engine’s performance



     Relevant              Retrieved
Evaluating a search engine’s performance



     Relevant               Retrieved

                    B   C
                A
Evaluating a search engine’s performance



     Relevant                   Retrieved

                        B   C
                    A




      Retrieved &
       relevant
Evaluating a search engine’s performance



            Relevant                   Retrieved

                               B   C
                           A




Not retrieved but
    relevant

             Retrieved &
              relevant
Evaluating a search engine’s performance



            Relevant                           Retrieved

                                  B        C
                           A




Not retrieved but
    relevant

             Retrieved &   Retrieved but
              relevant      not relevant
Evaluating a search engine’s performance



            Relevant                            Retrieved

                                  B        C
                           A



                                                Recall =     B
                                                            A+B
Not retrieved but
    relevant

             Retrieved &   Retrieved but       Precision =    B
                                                             B+C
              relevant      not relevant
Evaluating a search engine’s performance



            Relevant                            Retrieved

                                  B        C
                           A



                                                Recall =     B
                                                            A+B
Not retrieved but
    relevant

             Retrieved &   Retrieved but       Precision =    B
                                                             B+C
              relevant      not relevant
We can
understand search
engines even
better if we think
about the process
of searching.
Deconstructing the search experience



                                    Search
              Query
                                    Engine

                        Results




     Subset           Searchable
     of Web           information
Deconstructing the search experience

    Variety & usefulness
      of special queries

Automation                               Search
                   Query
                                         Engine

                             Results




         Subset            Searchable
         of Web            information
Deconstructing the search experience

       Variety & usefulness
         of special queries

 Automation                                     Search
                          Query
                                                Engine
     Content (pages,
categories, paid links)             Results
     Format of results
        Delivery form

             Subset               Searchable
             of Web               information
Deconstructing the search experience

       Variety & usefulness
         of special queries

 Automation                                     Search
                          Query
                                                Engine
     Content (pages,
categories, paid links)             Results
     Format of results
        Delivery form

  Target      Subset              Searchable
              of Web              information

 Quality of
                 Opacity
 coverage
Deconstructing the search experience

       Variety & usefulness
         of special queries

 Automation                                                    Search
                          Query
                                                               Engine
     Content (pages,
categories, paid links)             Results
     Format of results
        Delivery form

  Target      Subset              Searchable
              of Web              information

 Quality of                                   How frequently
                 Opacity
 coverage                                       updated?
Deconstructing the search experience

       Variety & usefulness                                    Quality of
         of special queries                                    the experience

 Automation                                                     Search
                          Query
                                                                Engine
     Content (pages,
categories, paid links)             Results                    Responsiveness
     Format of results
        Delivery form

  Target      Subset              Searchable
              of Web              information

 Quality of                                   How frequently
                 Opacity
 coverage                                       updated?
Considering: Google

Indexed information Query terms default   Search target


   Special search
                       Automation            Content
 terms & operators

                                          How frequently
Quality of coverage      Opacity
                                            updated?

                      Quality of the
  Responsiveness
                       experience
Will students learn that just Googling
something isn’t always the right thing to do?
Busy students might spend
           too much time
            unsuccessfully
             searching for
              information.
Multiple tools
are needed to
meet your
diverse and
changing needs
Students learn how to
      effectively and
       efficiently find
           information
              using the
             right tool.
Start working through the
              exercises to
                 begin to
              learn about
                  search.

02 Web Search