Identification of Interests, Trends and Dynamics in Document Networks

Contents


  1. Research Objective
  2. Publications, Reports, and Tools

Research Objective

The prime example of a Document Network (DN) is the World Wide Web (WWW). But many other types of such networks exist: bibliographic databases containing scientific publications, preprints, internal reports , as well as databases of datasets used in scientific endeavors. Each of these databases possesses several distinct relationships among documents and between documents and semantic tags or indices that classify documents appropriately. For instance, documents in the WWW are related via a hyperlink network, while documents in bibliographic databases are related by citation and collaboration networks. Furthermore, documents can be related to semantic tags such as keywords used to describe their content. Given these relations, we can compute distance functions amongst documents and/or semantic tags, thus creating associative networks between these items, which identify stronger or weaker co-associations. The figure below represents an associative network of people extracted from co-occurrence in documents in a database as described in an internal report. You can also see a 3D Video (Real Video) of this network.

Click for larger image

This project is investigating the hypothesis that the metric behavior of the distance functions defining these associative networks, can be used as an indicator of the relevance of collections of documents, the interests of users who have selected certain sets of documents, the trends in communities associated with sets of documents, as well the dynamics of such networks in general.


Publications, Reports, and Tools


The semi-metric methodology is now used by the givealink.org project. L. Stoilova, T. Holloway, B. Markines, A. Maguitman, F. Menczer [2006]: "GiveALink: Mining a Semantic Network of Bookmarks for Web Search and Recommendation". Proc. KDD Workshop on Link Discovery: Issues, Approaches and Applications.

Rocha, L.M., T. Simas, A. Rechtsteiner, M. DiGiacomo, R. Luce [2005]. "MyLibrary@LANL: Proximity and Semi-metric Networks for a Collaborative and Recommender Web Service". In: Proc. 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), IEEE Press. IEEE Press, pp. 565-571.

        Rocha, Luis M. [2002]. "Semi-metric Behavior in Document Networks and its Application to Recommendation Systems". In: Soft Computing Agents: A New Perspective for Dynamic Information Systems. V. Loia (Ed.) International Series Frontiers in Artificial Intelligence and Applications. IOS Press, pp. 137-163.

        Rocha, Luis M. [2002]. "Combination of Evidence in Recommendation Systems Characterized by Distance Functions". In: Proceedings of the 2002 World Congress on Computational Intelligence: FUZZ-IEEE'02. Honolulu, Hawaii, May 2002. IEEE Press, pp. 203-208. LAUR 02-154.

        Rocha Luis M. [2003]. "Extraction and Semi-metric Analysis of Social and Biological Networks". Poster at Networks: Structure, Dynamics and Function, May 12 - 16, 2003, Santa Fe, New Mexico, USA.

        Rocha, Luis M. [2002]. Proximity and Semi-Metric Analysis of Social Networks (pdf). Advanced Knowledge Integration In Assessing Terrorist Threats LDRD-DR. Los Alamos National Laboratory Internal Report. LAUR 02-6557

        Rocha, Luis M. [2001]. "Identification of Interests, Trends and Dynamics in Document Networks". Los Alamos National Laboratory LDRD ER Research Proposal. Awarded for FY02-04


For more information contact Luis Rocha at rocha@indiana.edu. Check the Web Design Credits, for due credit.
Last Modified: December 7, 2006