MIT Department of Electrical Engineering & Computer Science

E E C S

Querying Multiple Sources across the Internet

Luis Gravano
Stanford University

Wednesday, April 23, 1997
4:00 PM (3:45 refreshments)
Room NE43-518
EECS Special Seminar

Abstract

Information sources are available everywhere, both within the internal networks of organizations and on the Internet. To evaluate a query over many such sources and give the users the illusion of a single, large source, we need to perform three main tasks. First, we need to choose the best sources to evaluate a query. Second, we have to submit the query to these sources. Third, we need to merge the query results from the sources. I will first describe GlOSS, a scalable system that chooses the best document sources for a query. The GlOSS information about each source is orders of magnitude smaller than the source contents. I will also give an overview of the design of STARTS, an emerging protocol for Internet retrieval and search involving over 11 companies and organizations. Finally, I will address the problem of efficiently retrieving ranked information from sources. In particular, I will discuss the problem of merging ranked query results from sources with structured data, and optimization algorithms for queries over repositories of complex multimedia objects.


URL of this page: http://www-eecs.mit.edu/AY96-97/events/42.html
Created: Apr 14, 1997  | Modified: Jun 24, 1997
This announcement is from the MIT EECS 1996-97 archive.  | Current events
To MIT EECS home page  | Your comments and inquiries are welcome.