Current projects
What's New
Research Interests
The primary goal of my research is to make it truly easy for people to share, access and understand large-scale digital information. In particular, my current research focuses on the study of the evolution, management, retrieval and mining of information on the World-Wide Web. Here is the list of my current and past research projects.
Recent projects
Search engines are a big part of our everyday life. Most of us rely on search engines to discover and access contents from the Web. Does it mean that we are biased by what search engines process and present? What kind of and how much bias can search engines introduce? The primary goal of this research project is to investigate the problem of potential search-engine bias and come up with technical solutions to the problem.
The primary goal of this project is to develop enabling technologies to archive the history of the Web, so that our future generations are able to access the Web of today. As a testbed, we are currently downloading and archiving the blog history data and we are investigating related technical issues.
Blogs (or personal weblogs) are a fascinating new media where individuals publish and share their own opinions on various social, technical and personal issues. The primary goal of this research is to develop novel technologies to build a central portal for blogs where users can easily access, mine and retrieve new high-quality content on blogs.
Keyword-based search is a hugely successful and effective way to access textual information on the Web. We contend, however, that it is not the only way. The primary goal of this project is to develop new ways to access and exploit large-scale textual information, other than well-known keyword-based search. This is a new project that we just started.
The Web is a constantly evolving dynamic media. The primary goal of this research is to examine the change and evolution of the Web. The questions we try to answer include "how much new content is introduced to the Web every day?", "how often do Web pages change?", "how can we model the changes of Web pages?", etc.
A Web crawler is a program that retrieves and stores pages from the Web, commonly for a Web search engine, and its effectiveness has a significant impact on the quality of search results. The primary goal of this project is to investigate a number of technical issues to make a scalable, high-quality Web crawler.
Recent Professional Activities
Awards
Teaching
The following list shows the classes that I have been teaching at UCLA.