Welcome to Scribd!

Latent Topic Feedback For Information Retrieval David Andrzejewski, David Buttler

Uploaded by

0% found this document useful (0 votes)

6 views14 pages

Latent Topic Feedback for Information Retrieval David Andrzejewski, David Buttler Universidad Nacional de Colombia. Problem corpus: document metadata limited Specialized domain Large corpus, small user base. Solution Obtaining user feedback at the latent topic level Learn latent (unobserved) topics Construct representations of these topics Present potentially relevant topics to the user.

Original Description:

Original Title

Jg Slides

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views14 pages

Latent Topic Feedback For Information Retrieval David Andrzejewski, David Buttler

Uploaded by

Juan Romero

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 14

Search inside document

Latent Topic Feedback for Information Retrieval

David Andrzejewski, David Buttler

Juan Gabriel Romero
Universidad Nacional de Colombia
May 31, 2013
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 1 / 14
The Problem
Corpus:
Document metadata limited
Specialized domain
Large corpus, small user base
The user can not formulate the right query
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 2 / 14
Solution
Obtaining user feedback at the latent topic level
Learn latent (unobserved) topics
Construct representations of these topics
Present potentially relevant topics to the user
Augment the original query
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 3 / 14
Latent Dirichlet Allocation
Figure 1: Blei, D. Sep 2009. Topic Models
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 4 / 14
Latent Dirichlet Allocation
Figure 2: Blei, D. Sep 2009. Topic Models
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 5 / 14
Latent Dirichlet Allocation
P(w, z, , |, , d)

t
p(
t
|)

j
p(
j
|)

z
i
(w
i
)(z
i
)

To infer z, and , run Markov Chain Monte Carlo (Gibbs sampling) and,

t
(w) n
tw
+

j
(t) n
jt
+
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 6 / 14
Topic representation
First, with k = 10,
W
t
= k argmax
w

t
(w)
label generation (Best topic word)
Description Score
Word probability f
1
(w) = P(w|z = t)
Topic posterior f
2
(w) = P(z = t|w)
PMI f
3
(w) =

W
t
\wPMI (w, w

)
Conditional 1 f
4
(w) =

W
t
\wP(w|w

)
Conditional 2 f
5
(w) =

W
t
\wP(w

|w)
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 7 / 14
Topic representation
ngram identication (Turbo Topics)

Most signicant trigram

Two most signicant bigrams

Four most signicant unigrams

capitalization
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 8 / 14
Topic selection
Top 2 documents considered relevants
Enriched topics:
E =

dD
q
k argmax
t

d
(t)
Related topics:
R =

tE
k argmax
t

/ E
(t, t

)
Filter topics:
PMI (t) =
1
k(k 1)

(w,w

)W
t
PMI (w, w

)
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 9 / 14
Query expansion
Add 10 most probable words in the topic W
t
to the query
With [0, 1] as weight parameter.
For N
q
the words in the original query, the weight is
(1)
N
q
The weight for each word from the selected topic, then is

t
(w),
with

t
representing the re-normalized topic-word probability:

t
(w) =

t
(w)

W
t

t
(w

)
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 10 / 14
Experiments
Questions:
Can query expansion with latent topic feedback improve the result of
actual queries?
Assuming there are latent topics, will the topic selection described
present them to the user?
If presented with a helpful topic will a user actually select it?
(Outside the scope)
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 11 / 14
Experimental setup
Data set from TREC
MALLET
Preparation: Downcasing; removal of numbers, punctuacion marks;
stop words; lter rarely occuring words
Vocabularies between 10,000 and 20,000
Gibbs inference run 1,000 times re-estimating each 25 samples
500 topics
= 0.25
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 12 / 14
Results
Mean Average Precision, Normalized Discounted Cumulative Gain,
NDCG15
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 13 / 14
Results
For 40% of queries exist a latent topic that can enhance results
For 40% of these queries the approach nds relevant topics
Changes in the technique give worst results

Without ltering: Increase in the number of topics without substantial

increase in helpful topics retrieved

Excluding related topics: Decrease in the number of topics and the

helpful topics presented
Juan Gabriel Romero (Universidad Nacional de Colombia) Latent Topic Feedback for Information Retrieval May 31, 2013 14 / 14

ACT Math Section and SAT Math Level 2 Subject Test Practice Problems 2013 Edition
From Everand
ACT Math Section and SAT Math Level 2 Subject Test Practice Problems 2013 Edition
Dr. David Kronmiller
Rating: 3 out of 5 stars
3/5 (3)
Challenging Problems in Geometry
From Everand
Challenging Problems in Geometry
Alfred S. Posamentier
Rating: 3.5 out of 5 stars
3.5/5 (3)
Google Effects On Memory: Cognitive Consequences of Having Information at Our Fingertips
Document6 pages
Google Effects On Memory: Cognitive Consequences of Having Information at Our Fingertips
Meclib
No ratings yet
The Paradigm Recursion Is It More Accessible When Introduced in Middle School
Document32 pages
The Paradigm Recursion Is It More Accessible When Introduced in Middle School
Alexis Castellanos Escobar
No ratings yet
Algorithmic Problem Solving-Three Years On: January 2006
Document7 pages
Algorithmic Problem Solving-Three Years On: January 2006
Nefi Alcala Mariscal
No ratings yet
The Relationship Between Posing and Solving Division-With-Remainder Problems Among Flemish Upper Elementary School Children
Document19 pages
The Relationship Between Posing and Solving Division-With-Remainder Problems Among Flemish Upper Elementary School Children
ever josè pacheco
No ratings yet
The Paradigm Recursion: Is It More Accessible When Introduced in Middle School?
Document32 pages
The Paradigm Recursion: Is It More Accessible When Introduced in Middle School?
putri naryaa
No ratings yet
DCT Discussion
Document12 pages
DCT Discussion
Nguyen Xuan Trieu
No ratings yet
Algorithmic Problem Solving-Three Years On: January 2006
Document7 pages
Algorithmic Problem Solving-Three Years On: January 2006
89lalit
No ratings yet
Chapter 3 Thesis Statistical Treatment of Data Sample
Document8 pages
Chapter 3 Thesis Statistical Treatment of Data Sample
WebsitesToTypePapersElgin
100% (2)
Creating Diagrams For Problem-Solving in Mathematics - Is It Worth The Effort
Document10 pages
Creating Diagrams For Problem-Solving in Mathematics - Is It Worth The Effort
Global Research and Development Services
No ratings yet
The Structural Topic Model and Applied Social Science
Document4 pages
The Structural Topic Model and Applied Social Science
Laura Teresa Pinzón
No ratings yet
Sample Cognitive Abilities Test
Document3 pages
Sample Cognitive Abilities Test
Nina Adzhar
100% (1)
MS Project Report - Final - GrockIt On
Document10 pages
MS Project Report - Final - GrockIt On
rohananil
No ratings yet
Syllabus For STA 2023 - Introduction To Statistics: Spring 2017 - ONLINE Instructor Information
Document15 pages
Syllabus For STA 2023 - Introduction To Statistics: Spring 2017 - ONLINE Instructor Information
sara
No ratings yet
Thesis Chapter 3 Statistical Tools
Document6 pages
Thesis Chapter 3 Statistical Tools
bsend5zk
100% (2)
Metaheuristics: An Introduction to Optimization Algorithms
Document88 pages
Metaheuristics: An Introduction to Optimization Algorithms
Ahmed Bhd
No ratings yet
Masters Thesis Uw Madison
Document4 pages
Masters Thesis Uw Madison
lindagosnellfortwayne
100% (2)
Student's Solutions Manual Elementary Number Theory Burton
Document3 pages
Student's Solutions Manual Elementary Number Theory Burton
Musician Souvik Saha
23% (13)
New Deal Lesson Plan Edu360
Document20 pages
New Deal Lesson Plan Edu360
api-341207112
No ratings yet
Observation1 Champlain Higgs
Document7 pages
Observation1 Champlain Higgs
api-537332617
No ratings yet
Test Bank For Qualitative Research Methods For The Social Sciences 9Th Edition Lune 0134202139 9780134202136 Full Chapter PDF
Document28 pages
Test Bank For Qualitative Research Methods For The Social Sciences 9Th Edition Lune 0134202139 9780134202136 Full Chapter PDF
estelle.barnett615
100% (10)
Learning From Data
Document402 pages
Learning From Data
Anuvrat Tiku
No ratings yet
DTM1_Galbraith(2006)
Document8 pages
DTM1_Galbraith(2006)
4qvqj9z85q
No ratings yet
CS168: Generalization or How Much Data Is Enough
Document16 pages
CS168: Generalization or How Much Data Is Enough
Danish Shabbir
No ratings yet
Lesson 1 - Single Digit Multiplication
Document3 pages
Lesson 1 - Single Digit Multiplication
api-483522238
No ratings yet
13 Verbs
Document9 pages
13 Verbs
Tan Yan Hui
No ratings yet
Qualitative Research Methods For The Social Sciences 9th Edition Lune Test Bank
Document7 pages
Qualitative Research Methods For The Social Sciences 9th Edition Lune Test Bank
marilyn
100% (25)
Gatoolbox: A Matlab-Based Genetic Algorithm Toolbox For Function Optimization
Document12 pages
Gatoolbox: A Matlab-Based Genetic Algorithm Toolbox For Function Optimization
Ramazan Selçuk
No ratings yet
A Memetic Algorithm (Genetic Algorithm) For VLSI Floorplanning
Document40 pages
A Memetic Algorithm (Genetic Algorithm) For VLSI Floorplanning
eeshgarg
No ratings yet
Cultural Lesson Plan
Document8 pages
Cultural Lesson Plan
api-468889155
No ratings yet
Term Paper in Math
Document4 pages
Term Paper in Math
c5r0qjcf
100% (1)
Design Document Ebel Pachon
Document62 pages
Design Document Ebel Pachon
api-500203424
No ratings yet
Thesis English Subtitles
Document8 pages
Thesis English Subtitles
Portland
100% (7)
Unit 03 - History - Mod - Appli - INFORMS - Contest 210922
Document38 pages
Unit 03 - History - Mod - Appli - INFORMS - Contest 210922
Teresa
No ratings yet
Lesson Plan For Implementing NETS - S-Template I: (More Directed Learning Activities)
Document13 pages
Lesson Plan For Implementing NETS - S-Template I: (More Directed Learning Activities)
api-363643259
100% (1)
LP 9 15-9 19
Document3 pages
LP 9 15-9 19
api-265518014
No ratings yet
Regents Exam in Algebra I (Common Core) Sample Items May 2013
Document58 pages
Regents Exam in Algebra I (Common Core) Sample Items May 2013
Julia Garcia-Lascurain
No ratings yet
Constructing a happiness index using polytomous IRT models
Document9 pages
Constructing a happiness index using polytomous IRT models
Jaime Pazmino
No ratings yet
895 Syllabus Fall 2012
Document14 pages
895 Syllabus Fall 2012
bdole787
No ratings yet
Thesis Fabrizio Galli
Document22 pages
Thesis Fabrizio Galli
Fabri Galli
No ratings yet
Stat
Document60 pages
Stat
Bacquial, Phil Gio E. (Phil Gio)
No ratings yet
Constructing A Performance Task Scenario Using Grape Shalisa Gee
Document2 pages
Constructing A Performance Task Scenario Using Grape Shalisa Gee
api-288254031
No ratings yet
Open Problem: Regret Bounds For Thompson Sampling: 1. Background
Document3 pages
Open Problem: Regret Bounds For Thompson Sampling: 1. Background
oscar
No ratings yet
Introducing Decision Theory Analysis (DTA) and Classification and Regression Trees (CART)
Document30 pages
Introducing Decision Theory Analysis (DTA) and Classification and Regression Trees (CART)
bharadwajdes3
No ratings yet
Mabe Ar (2) For Final
Document52 pages
Mabe Ar (2) For Final
Zaila Valerie Babanto
No ratings yet
2022 Spring Qta
Document13 pages
2022 Spring Qta
long mark
No ratings yet
Mont Fleur Swales CARS Model
Document55 pages
Mont Fleur Swales CARS Model
abbayat1
100% (1)
Research Paper in Abstract Algebra
Document5 pages
Research Paper in Abstract Algebra
qtbghsbnd
100% (1)
Dissertation Classification Calculator
Document7 pages
Dissertation Classification Calculator
PaySomeoneToWriteYourPaperSpringfield
100% (1)
CERME9.TWG01.11.papers - Erkek Bostan
Document8 pages
CERME9.TWG01.11.papers - Erkek Bostan
Felipe Suárez
No ratings yet
Bachelor Thesis Kommunikationspolitik
Document5 pages
Bachelor Thesis Kommunikationspolitik
HelpInWritingPaperHartford
100% (2)
Design Document Ebel Pachon
Document66 pages
Design Document Ebel Pachon
trudypac
No ratings yet
Boosting Theory-of-Mind Performance in Large Language Models
Document27 pages
Boosting Theory-of-Mind Performance in Large Language Models
Victor Löfgren
No ratings yet
Tic3151 2
Document4 pages
Tic3151 2
api-304236873
No ratings yet
Springer Journal of Philosophical Logic: This Content Downloaded From 129.130.252.222 On Mon, 08 Aug 2016 09:12:16 UTC
Document27 pages
Springer Journal of Philosophical Logic: This Content Downloaded From 129.130.252.222 On Mon, 08 Aug 2016 09:12:16 UTC
Guido Palacin
No ratings yet
Effect of Duval Cognitive Model On Geometric Reasioning: Dr. Manju Gera
Document11 pages
Effect of Duval Cognitive Model On Geometric Reasioning: Dr. Manju Gera
luca
No ratings yet
Math T Coursework 2016
Document5 pages
Math T Coursework 2016
f5dct2q8
100% (2)
Industrial Mathematics Thesis Topics
Document4 pages
Industrial Mathematics Thesis Topics
Felicia Clark
100% (1)
POTWA 19 Combined3 4
Document170 pages
POTWA 19 Combined3 4
Nguyen MinhThanh
No ratings yet
01-Time and Space Complexity
Document41 pages
01-Time and Space Complexity
Bhuvaneswaran B
No ratings yet
SCSA3015 Deep Learning Unit 4 PDF
Document30 pages
SCSA3015 Deep Learning Unit 4 PDF
pooja vikirthini
No ratings yet
Outline of Artificial Intelligence
Document18 pages
Outline of Artificial Intelligence
Abdullah Yusuf
No ratings yet
R-1 Transformation of Roots
Document2 pages
R-1 Transformation of Roots
Prabhat Sharma
No ratings yet
Operations Research Models and Techniques MCQs
Document6 pages
Operations Research Models and Techniques MCQs
Surekha Debadwar
No ratings yet
MATLAB Frame Analysis Programing Code
Document4 pages
MATLAB Frame Analysis Programing Code
TRPMEINHARDT
100% (8)
9 Chapter 4 Simplex Method
Document19 pages
9 Chapter 4 Simplex Method
tanmoy biswas
No ratings yet
Numerical Methods With Matlab - ch10 - Solution
Document8 pages
Numerical Methods With Matlab - ch10 - Solution
Devesh Kumar
No ratings yet
Signals and Systems: CE/EE301
Document12 pages
Signals and Systems: CE/EE301
Abdelrhman Mahfouz
No ratings yet
Intro To Polynomials Guided Notes
Document24 pages
Intro To Polynomials Guided Notes
Mj Endozo
100% (1)
Answers To Exercise Chapter 1-4 Math 4
Document6 pages
Answers To Exercise Chapter 1-4 Math 4
Zulhelmie Zaki
No ratings yet
Numerical Methods UNIT - III INTERPOLATION: December 2014
Document60 pages
Numerical Methods UNIT - III INTERPOLATION: December 2014
Ankit Dutta
No ratings yet
1.04 Characteristics of Polynomial Functions (FILLED In) PDF
Document3 pages
1.04 Characteristics of Polynomial Functions (FILLED In) PDF
Vasile Nicoleta
No ratings yet
Factoring Polynomials (Difference of Two Squares)
Document13 pages
Factoring Polynomials (Difference of Two Squares)
Kim Rebucas
No ratings yet
Big M Method for Solving LPP with Non-Negative Constraints
Document18 pages
Big M Method for Solving LPP with Non-Negative Constraints
Amarnath Chandran
No ratings yet
30 Frequently Asked Deep Learning Interview Questions and Answers
Document28 pages
30 Frequently Asked Deep Learning Interview Questions and Answers
Khirod Behera
100% (1)
Principles of Training Multi-Layer Neural Network Using Backpropagation
Document15 pages
Principles of Training Multi-Layer Neural Network Using Backpropagation
kamalamdharman
100% (1)
Curve Fitting and Interpolation Techniques
Document41 pages
Curve Fitting and Interpolation Techniques
Winda Asfilasifa
No ratings yet
Nr411006 Neural Networks Fuzzy Logic Control Set1
Document3 pages
Nr411006 Neural Networks Fuzzy Logic Control Set1
Srinivasa Rao G
100% (1)
Linear Algebra Cheat Sheet
Document2 pages
Linear Algebra Cheat Sheet
traponegro
No ratings yet
Applied Optimization Techniques
Document28 pages
Applied Optimization Techniques
Fawzi Saada
No ratings yet
Mastering Python For Finance - Sample Chapter
Document24 pages
Mastering Python For Finance - Sample Chapter
Packt Publishing
100% (3)
Neural Network Toolbox Command List
Document4 pages
Neural Network Toolbox Command List
Aditya Chaudhary
No ratings yet
CFD Theory
Document46 pages
CFD Theory
ok right
No ratings yet
Blast (Basic Local Alignment Search Tool)
Document28 pages
Blast (Basic Local Alignment Search Tool)
yasasve
No ratings yet
Motivation
Document2 pages
Motivation
Rem Ivask
No ratings yet
Role of Bisection Method in Finding The Roots of Any Given Transcendental Equation.
Document10 pages
Role of Bisection Method in Finding The Roots of Any Given Transcendental Equation.
Sky Jais
No ratings yet
Week4 PDF
Document56 pages
Week4 PDF
helen
100% (1)
CSC 601 Theory of Computation Assignment 2 Solutions
Document4 pages
CSC 601 Theory of Computation Assignment 2 Solutions
Rosa
No ratings yet
Booth Algorithm, Toom Cook, Restoring and Non Restoring, Karatsuba
Document9 pages
Booth Algorithm, Toom Cook, Restoring and Non Restoring, Karatsuba
dileshwar
No ratings yet