added by Benjamin Searle · updated 2y ago
Rethinking Search: Making Domain Experts out of Dilettantes
- This represents a fundamentally different way of thinking about IR systems. Within the index-retrieve-then-rank paradigm, modeling work (e.g., query understanding, document understanding, retrieval, ranking, etc.) is done on top of the index itself. This results in modern IR systems being comprised of a disparate mix of heterogeneous models (e.g., ... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
- - Leveraging Document and Corpus Structure- Scaling to Multiple Languages]
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
- [...] Today’s cutting edge IR systems are not fundamentally different than classical IR systems developed many decades ago. Indeed, a majority of today’s systems boil down to: (a) building an efficient queryable index for each document in the corpus, (b) retrieving a set of candidates for a given query, and (c) computing a relevance score for each ... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
Benjamin Searle added 2y ago
- We envision using the same corpus model as a multi-task learner for multiple IR tasks. To this end, once a corpus model has been trained, it can of course be used for the most classical of all IR tasks – document retrieval. However, by leveraging recent advances in multi-task learning, such a model can very likely be applied to a diverse range of t... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
- Pre-trained language models (LM), by contrast, are capable of directly generating prose that may be responsive to an information need, but at present they are *dilettantes* rather than domain experts – they do not have a true understanding of the world, they are prone to hallucinating, and crucially they are incapable of justifying their utterances... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
- To replace indexes with a single, consolidated model, it must be possible for the model itself to have knowledge about the universe of document identifiers, in the same way that traditional indexes do. One way to accomplish this is to move away from traditional LMs and towards corpus models that jointly model term-term, term-document, and document-... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
- The very fact that ranking is a critical component of this paradigm is a symptom of the retrieval system providing users a selection of potential answers, which induces a rather significant cognitive burden on the user. The desire to return answers instead of ranked lists of results was one of the motivating factors for developing question answerin... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago
- If all of these research ambitions were to come to fruition, the resulting system would be a very early version of the system that we envisioned in the introduction. That is, the resulting system would be able to provide domain expert answers to a wide range of information needs in a way that neither modern IR systems, question answering systems, o... See more
from Rethinking Search: Making Domain Experts out of Dilettantes by Donald Metzler
Benjamin Searle added 2y ago