Professional Documents
Culture Documents
Avi Rappoport Search Tools Consulting www.searchtools.com consult1@searchtools.com UC Berkeley SIMS class 202 September 16, 2004
Information Architecture is vital Usable sites have good navigation and structure
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
Display results
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
Search Processing
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
content
search functionality
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
10
Sour
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
11
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
12
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
13
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
14
Index Issues
Stopwords Stemming Metadata
Explicit (tags) Implicit (context)
Semantics
CMS and Database fields XML tags and attributes
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
15
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
16
Retrieval = Matching
Single-word queries
Find items containing that word
18
19
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
20
Relevance Ranking
Theory: sort the matching items, so the most relevant ones appear first Can't really know what the user wants Relevance is hard to define and situational Short queries tend to be deeply ambiguous
What do people mean when they type bank?
First 10 results are the most important The more transparent, the better
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
21
Relevance Processing
Sorting documents on various criteria Start with words matching query terms Citation and link analysis
Like old library Citation Indexes Ted Nelson - not only hypertext, but the links Google PageRank
Incoming links Authority of linkers
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
23
Other Algorithms
Vector space Probabilistic (binary interdependence) Fuzzy set theory Bayesian statistical analysis Latent semantic indexing Neural networks Machine learning All require sophisticated queries See MIR, chapter 2
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
24
Relevance Heuristics
Heuristics are rules of thumb
Not algorithms, not math
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
25
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
26
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
27
Back to Simplicity
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
28
29
MSU Keywords
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
30
Siemens Results
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
31
Cooks.com Results
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
32
Salon.com Results
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
33
34
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
35
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
36
37
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
This presentation:
www.searchtools.com/slides/sims/202-04/
UCB SIMS 202, Sept. 2004 Avi Rappoport, Search Tools Consulting
39