Our purpose in this paper is to review and compare two approaches to hyperlink analysis, and thereby to contribute to methodological discussions in Internet studies. The approaches and goals of these two areas have some features in common, but also important differences that it will be useful to identify and explain. Both areas have the broad goal of extracting useful information about Web use and employ the general approach of using predominantly quantitative techniques to display or summarize hyperlink-based data. This is in contrast to hyperlink analysis as conducted by statistical physicists and computer scientists, which adopts a much more abstract approach, as will be revealed in the brief review of these areas below.
The two approaches come from different fields. Hyperlink Network Analysis (HNA) derives from Social Network Analysis, whereas Webometrics derives from information science. As a result it would be natural to expect the different backgrounds to be reflected in the kinds of problems tackled, the methods developed and the outcomes sought. Field differences can also mean that there is little interaction between practitioners of each approach, and our review aims to enrich the understanding of these approaches by highlighting what they share and how they differ. 数据挖掘实验室
A second motivation for this review is to introduce the techniques to a wider group of researchers, in the belief that with the increasing importance of the Web for an ever-broader spectrum of human activities, online analysis techniques should be more widely developed and exploited. Hyperlink analysis provides Internet researchers with new analytical methods for the study of networked (or connected) structures on the World Wide Web. This paper both provides a review of hyperlink research that originates in hyperlink network analysis or Webometrics communities and examines practical issues related to link data-collection techniques.
Compared to other Web methods such as a content-based analysis, the relative advantage of hyperlink analysis is that it is able to examine the way in which Web sites form a certain kind of relations with others via hyperlinks. According to Weare and Lin (2000), content-based studies may be missing an opportunity if they fail to consider the hyperlinked environment. Many Web sites with common topics are hyperlinked together, which allows users to access materials or services hosted on other sites. Given this interweaving hyperlinking structure, it may be necessary to recognize individual Web sites as mutually dependent entities, which constitute a Web system. If a content analysis of individual Web sites does not include materials to which the Web site under investigation hyperlinks (such as academic reports that are hosted on other sites), it fails to see the structures in the environment that afford social navigation. Also, hyperlink analysis enables visualization of navigational elements, such as changes in the hyperlink structure of contents of a Web site. It also makes possible the quantification of relational attributes among sites within a community being studied. Using this information in combination with other Web analyses (Howard, 2002; Park, 2002c) can contribute to the understanding of why and how certain types of contents come to appear on Web sites.
数据挖掘实验室
References
Adamic, L. A. (1999). The small world Web. Proceedings of 3rd European Conference of Research and Advanced Technology for Digital Libraries, ECDL. Retrieved October 3, 2002 from http://www.hpl.hp.com/shl/papers/smallworld/smallworldpaper.html.
Adamic, L. A., & Adar, E. (2001, May). You are what you link. Paper presented to the 10th annual International World Wide Web Conference, Hong Kong. Retrieved June 19, 2001 from http://www10.org/program/society/yawyl/YouAreWhatYouLink.htm.
数据挖掘研究院
Aguillo, I. F. (1998). STM information on the Web and the development of new Internet R&D databases and indicators, in Online Information 98: Proceedings. Learned Information, 239-243.
Albert, R., Jeong, H., & Barabasi, A. -L. (1999). Diameter of the World Wide Web. Nature, 401(9), 130-131.
数据挖掘研究院
Aldenderfer, M. S., & Blashfield, R. K. (1984). Cluster analysis. Beverly Hills, CA: Sage.
Almind, T. C. & Ingwersen, P. (1997). Informetric analyses on methodological approaches to ′Webometrics.′ Journal of Documentation, 53(4) 404-426.
数据挖掘研究院
Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A. & Raghavan, S. (2001). Searching the Web. ACM Transactions on Internet Technology, 1(1), 2-43.
数据挖掘研究院
Bae, S., & Choi, J. H. (2000, April). Cyberlinks between human rights NGOs: A network analysis. Paper presented to the 58th annual national meeting of the Midwest Political Science Association, Chicago.
Baeza-Yates, R. & Castillo, C. (2001). Relating Web characteristics with link based Web page raking. Proceedings of SPIRE 2001, IEEE (pp. 21-32). CS Press, Laguna San Rafael, Chile.
数据挖掘研究院
Bar-Ilan, J. (1999). Search engine results over time - A case study on search engine stability. Cybermetrics, 2/3. Retrieved January 3, 2003 from http://www.cindoc.csic.es/cybermetrics/articles/v2i1p1.html. Available http://dois.mimas.ac.uk/DoIS/data/Articles/upvupvcyby:1998-99:v:2-3:i:1:p:1.html.
数据挖掘实验室
Bar-Ilan, J. (2001). Data collection methods on the Web for informetric purposes - A review and analysis. Scientometrics, 50(1), 7-32.
数据挖掘研究院
Barnett, G. A. (1993). Correspondence analysis: A method for the description of communication networks. In Barnett, G., & Richards, W. (Eds.). Progress in communication sciences (pp. 135-164). Norwood, N.J.: Ablex.
数据挖掘研究院
Barnett, G. A. (2001). A longitudinal analysis of the international telecommunication network, 1978-1996. American Behavioral Scientist, 44(10), 1638-1655.
数据挖掘实验室
Barnett, G. A., Chon, B., Park, H., & Rosen, D. (2001, May). An examination of international Internet flows: An autopoietic model. Paper presented at the annual conference of International Communication Association, Washington, D.C.
数据挖掘研究院
Beaulieu, A., & Simakova, (2002). The public face of databases: data resources on the web and the creation of trust in science. Paper presented at the Society for the Social Studies of Science (4S) 2002 Annual Meeting. Milwaukee, USA.
Birnie, S. A., & Horvath, P. (2002). Psychological predictors of Internet social communication. Journal of Computer-Mediated Communication, 7(4). Retrieved September 24, 2002 from http://www.ascusc.org/jcmc/vol7/issue4/horvath.html.
Bonacich, P., & Lloyd, P. (2001). Eigenvector-like measures of centrality for asymmetric relations. Social Networks, 23, 191-201.
数据挖掘研究院
Björneborn, L., & Ingwersen, P. (2001). Perspectives of Webometrics. Scientometrics, 50(1), 65-82.
数据挖掘研究院
Björneborn, L. (2001). Small-world linkage and co-linkage. Proceedings of the 12th ACM Conference on Hypertext and Hypermedia (pp. 133-134). New York: ACM Press.
数据挖掘实验室
Borgman, C & Furner, J. (2002), Scholarly communication and bibliometrics. In Cronin, B. (ed.), Annual Review of Information Science and Technology 36 (pp. 3-72). Medford, NJ: Information Today Inc..
数据挖掘实验室
Brin, S. & Page, L. (1998). The anatomy of a large scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7), 107-117.
Broder, A. Kumar, R, Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A. & Wiener, J. (2000). Graph structure in the Web, Journal of Computer Networks, 33(1-6), 309-320.
Brunn, S. D., & Dodge, M. (2001). Mapping the ′Worlds′ of the world wide web: (Re)Structuring global commerce through hyperlinks. American Behavioral Scientist, 44(10), 1717-1739.
Burt, R. S. (1992). Structural holes: The social structure of competition. Cambridge, MA: Harvard University Press.
数据挖掘研究院
Chu, H., He, S. & Thelwall, M. (2002). Library and information science schools in Canada and USA: A Webometric perspective. Journal of Education for Library and Information Science 43(2), 110-125.
Ciolek, T. M. (2001, September). Networked information flows in Asia: The research uses of the AltaVista search engine and "weblinksurvey" software. Paper presented to the Internet Political Economy Forum 2001: Internet and Development in Asia, Singapore. Retrieved February 8, 2002 from http://www.ciolek.com/PAPERS/weblinksurvey2001.html.
数据挖掘研究院
Cronin, B. (1984). The citation process. London: Taylor Graham.
Cronin, B. (2001). Bibliometrics and beyond: Some thoughts on Web-based citation analysis. Journal of Information Science, 27(1), 1-7.
数据挖掘研究院
Cronin, B., Snyder, H.W., Rosenbaum, H., Martinson, A. & Callahan, E. (1998). Invoked on the Web. Journal of the American Society for Information Science, 49(14), 1319-1328.
数据挖掘研究院
Cui, L. (1999). Rating health Web sites using the principles of citation analysis: A bibliometric approach. Journal of Medical Internet Research, 1(1), e4. Retrieved January 3, 2003 from http://www.jmir.org/1999/1/e4/index.htm.
Danowski, J., & Edison-Swift, P. (1985). Crisis effects on intraorganizational computer-based communication. Communication Research, 12(2), 251-270.
Darmoni S. J., Thirion B., Douyère M., Challoub, C., & Leroy J. P. (2000). Mesure de l′impact des sites Web : Le Web impact factor. L′exemple des CHU français. Revue du Praticien - Médecine Générale, 14(516), 2079-2080.
Davenport, E. & Cronin, B. (2000). The citation network as a prototype for representing trust in virtual environments. In: Cronin, B. & Atkins, H. B. (Eds.). The Web of knowledge: a festschrift in honor of Eugene Garfield. Metford, NJ: Information Today Inc. ASIS Monograph Series, 517-534.
Douyère, M., Soualmia, L.F., Le Duff, F., Thelwall, M. & Darmoni, S.J. (2002, May). Web impact factor : Un outil bibliométrique appliqué aux sites Web des facultés de médecine et des CHU français, Neuvièmes Journées Francophones d′Informatique Médicale. Québec, Canada.
数据挖掘实验室
Flake, G.W., Lawrence, S., Giles, C.L., & Coetzee, F.M. (2002). Self-organization and identification of Web communities, IEEE Computer, 35, 66-71.
数据挖掘实验室
Freeman, L. C. (1979). Centrality in social networks: Conceptual clarification. Social Networks, 1, 215-239.
数据挖掘研究院
Garfield, E. (1979). Citation indexing: Its theory and applications in science, technology and the humanities. New York: Wiley Interscience.
数据挖掘研究院
Garton, L., Haythornthwaite, C., & Wellman, B. (1997). Studying online social networks. Journal of Computer-Mediated Communication, 3(1). Retrieved September 19, 2000 from http://www.ascusc.org/jcmc/vol3/issue1/garton.html.
数据挖掘研究院
Gay, G., Stefanone, M., Grace-Martin, M., & Hembrooke, H. (2001). The effects of wirelss computing in collaborative learning environments. International Journal of Human-Computer Interaction, 13(2), 257-276.
数据挖掘研究院
Goodrum, A. A., McCain, K. W., Lawrence, S. & Giles, C. L. (2001). Scholarly publishing in the Internet age: A citation analysis of computer science literature. Information Processing & Management, 37(5), 661-676.
Halavais, A. (2000). National borders on the World Wide Web. New Media & Society, 2(1), 7-28.
数据挖掘研究院
Halavais, A., & Garrido, M. (2003). Mapping networks of support for the Zapatista movement. In McCaughy, M., & Ayers, M. D. (Eds.), Cyberactivism: Online activism in theory and practice. London: Routledge.
Hampton, K. N., & Wellman, B. (2000). Examining community in the digital neighborhood: Early results from Canada′s wired suburb. In Ishida, T. & Isbister, K. (Eds.), Digital cities: Technologies, experiences, and future perspectives. (pp. 194-208). Heidelberg, Germany: Springer-Verlag.
Hampton, K. N. (1999). Computer assisted interviewing: The design and application of survey software to the wired suburb project. Bulletin de Methode Sociologique (BMS), 62,49-68.
Hargittai, E. (1999). Weaving the western Web: Explaining differences in Internet connectivity among OECD countries. Telecommunications Policy, 23(10/11), 701-718.
数据挖掘实验室
Harter, S. & Ford, C. (2000). Web-based analysis of E-journal impact: Approaches, problems, and issues. Journal of the American Society for Information Science, 51(13), 1159-76.
Haythornthwaite, C. (2000). Online personal networks: Size, composition and media use among distance learners. New Media & Society, 2(2), 195-226.
Haythornthwaite, C., & Wellman, B. (1998). Work, friendship and media use for information exchange in a networked organization. Journal of the American Society for Information Science, 46(12), 1101-1114.
数据挖掘实验室
Henzinger, M. R. (2001), Hyperlink analysis for the web. IEEE Internet Computing 5(1), 45-50.
Hernandez-Borges, A., Macias-Cervi, P., & Gaspar Guadardo, M. (1999). Can examination of WWW usage statistics and other indirect quality indicators help to distinguish the relative quality of medical Web sites? Journal of Medical Internet Research, 1 e1. Retrieved January 3, 2003 from http://www.jmir.org/1999/1/e1/index.htm.
数据挖掘研究院
Howard, P. (2002). Network ethnography and the hypermedia organization: New organizations, new media, new methods. New Media & Society, 4(4), 551-575.
数据挖掘研究院
Ingwersen, P. (1998). The calculation of Web impact factors. Journal of Documentation, 54(2), 236-243.
Jackson, M. H. (1997). Assessing the structure of communication on the world wide web. Journal of Computer-Mediated Communication, 3(1). Retrieved September 19, 2000 from http://www.ascusc.org/jcmc/vol3/issue1/jackson.html.
Jones, S. (Ed.) Doing Internet research: Critical issues and methods for examining the Net. Thousand Oaks, CA: Sage.
数据挖掘研究院
Kang, N., & Choi, J. H. (1999). Structural implications of the crossposting network of international news in cyberspace. Communication Research, 26(4), 454-481.
Kim, H. J. (2000). Motivations for hyperlinking in scholarly electronic articles: A qualitative study. Journal of the American Society for Information Science, 51(10), 887-899.
数据挖掘研究院
Kleinberg, J., (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5), 604-632.
数据挖掘研究院
Kling, R. & McKim, G. (2000). Not just a matter of time: Field differences in the shaping of electronic media in supporting scientific communication. Journal of the American Society for Information Science, 51(14), 1306-1320.
数据挖掘研究院
Kling, R. (2000). Learning about information technologies and social change: The contribution of social informatics. The Information Society, 16(3), 217-232.
Kling, R., McKim, G., & King, A. (2001). A bit more to IT: Scholarly communication forums as socio-technical interaction networks. Unpublished manuscript retrieved August 14, 2002 from http://www.slis.indiana.edu/csi/Wp/wp01-02B.html.
Koku, E., Nazer, N., & Wellman, B. (2001). Netting scholars: Online and offline. American Behavioral Scientist, 43 (Special issue: Mapping globalization), 1750-1772.
Krackhardt, D., & Porter, L. (1986). The snowball effect: Turnover embedded in communication networks. Journal of Applied Psychology, 71, 50-55.
数据挖掘研究院
Krebs, V. (2000). Working in the connected world book network. IHRIM (International Association for Human Resource Information Management) Journal, 4(1), 87-90.
数据挖掘实验室
Kumar, R., Raghavan, P., Rajagopalan, S., & Tomkins, A. (1999, April). Trawling the Web for cyber communities. Paper presented to the 8th World Wide Web Conference, Toronto, Canada.
Larson, R. R. (1996). Bibliometrics of the World Wide Web: an exploratory analysis of the intellectual structure of cyberspace. Proceedings of the AISS 59th annual meeting.
数据挖掘实验室
Lawrence, S. & Giles, C. L. (1999). Accessibility of information on the Web. Nature, 400, 107-109.
Leydesdorff, L. & Curran, M. (2000). Mapping university-industry-government relations on the Internet: The construction of indicators for a knowledge-based economy, Cybermetrics, 4. Retrieved January 3, 2003 from http://www.cindoc.csic.es/cybermetrics/articles/v4i1p2.html. 数据挖掘研究院
Li, X., Thelwall, M., Musgrove, P., & Wilkinson, D. (2002, September). The relationship between the links/Web impact factors of computer science departments in UK and their RAE (Research Assessment Exercise) ranking in 2001. Paper presented to the Seventh International S&T Indicators Conference, Karlsruhe, Germany.
Mann, C., & Stewart, F. (2000). Internet communication and qualitative research: A handbook for researching online. Thousand Oaks, CA: Sage.
Matei, S., & Ball-Rokeach, S. (2001). Real and virtual social ties: Connections in the everyday lives of seven ethnic neighborhoods. American Behavioral Scientist, 45(3), 550-563.
Matzat, U. (2001). Social networks and cooperation in electronic communities. A theoretical-empirical study on academic communication and Internet discussion groups. Amsterdam: Thela Publisher.
McPherson, M., Smith-Lovin1, L., & Cook, J. M. (2001). Birds of a feather: Homophily in social networks. Annual Review of Sociology, 27, 415-444.
数据挖掘实验室
Mettrop, W. & Nieuwenhuysen, P. (2001). Internet search engines - fluctuations in document accessibility. Journal of Documentation, 57(5), 623-651.
Milgram, S. (1967). The small world problem. Psychology Today, 1(1), 60-67.
数据挖掘研究院
Moed, H. (2002). The impact-factors debate: The ISI′s uses and limits, Nature, 415, 731-732.
Monge, P., & Contractor, N. S. (2000). Emergence of communication networks. In Jablin, F. M., & Putnam, L. L. (Eds.), The new handbook of organizational communication: advances in theory, research, and methods (pp. 440-502). Thousand Oaks, CA: Sage.
数据挖掘研究院
Oppenheim, C. (1997). The correlation between citation counts and the 1992 research assessment exercise ratings for British research in genetics, anatomy and archaeology. Journal of Documentation, 53, 477-487.
Oppenheim, C. (2000). Do patent citations count? In Cronin, B. & Atkins, H. B. (Eds.), The Web of knowledge: A festschrift in honor of Eugene Garfield. Metford, NJ: Information Today Inc. ASIS Monograph Series, 405-432.
Palmer J. W., Bailey, J. P., & Faraj. S. (2000). The role of intermediaries in the development of trust on the WWW: The use and prominence of trusted third parties and privacy statements. Journal of Computer-Mediated Communication, 5(3). Retrieved Jun 22, 2000 from http://www.ascusc.org/jcmc/vol5/issue3/palmer.html. 数据挖掘研究院
Paolillo, J. C. (2001). Language variation on Internet Relay Chat: A social network approach. Journal of Sociolinguistics, 5(2), 180-213.
数据挖掘研究院
Park, H. W. (in press). What is hyperlink network analysis?: New method for the study of social structure on the Web. Connections.
Park, H. W. (2002a). Examining the determinants of who is hyperlinked to whom: A survey of Webmasters in Korea. First Monday, 7(11). Retrieved November 19, 2002 from http://www.firstmonday.org/issues/issue7_11/park/index.html.
Park, H. W. (2002b, November). E-science and hyperlink network analysis: Collaborative communication through hyperlinking. Paper presented to the conference of the Netherlands School of Communications Research, Utrecht, Netherlands.
数据挖掘实验室
Park, H. W. (2002c, December). No diplomatic relationship between Korea and Taiwan on the Web? Looking at an enemy′s Websites. Paper presented to the TKU (Tamkang University) 2002 International Communication Convention, Taipei, Taiwan.
Park, H. W., Barnett, G. A., & Kim, C. S. (2001). Internet communication structure in Korean National Assembly: A network analysis. Korean Journal of Journalism & Communication Studies (Special English edition), 185-204.
Park, H. W., Barnett, G. A., & Kim, C. S. (2000). Political communication structure in Internet networks - A Korean case. Sungkok Journalism Review, 11, 67-89.
Park, H. W., Barnett, G. A. & Nam, I. Y. (2002a). Hyperlink-affiliation network structure of top Web sites: Examining affiliates with hyperlink in Korea. Journal of the American Society for Information Science and Technology, 53(7), 592-601.
数据挖掘研究院
Park, H. W., Barnett, G. A., & Nam, I. Y. (2002b). Interorganizational hyperlink networks among websites in South Korea. NETCOM: Networks and communications studies, 16 (3/4, Special issue on the Internet development in Asia), 155-173.
数据挖掘研究院
Pirolli, P., & Card, S. K. (1999). Information foraging. Psychological Review, 106(4), 643-675.
Polanco, X, Boudourides, M. A., Besagni, D., & Roche, I. (2001). Clustering and mapping Web sites for displaying implicit associations and visualising networks. University of Patras.
Rice, R. E., & Barnett, G. (1986). Group communication networking in an information environment: Applying metric multidimensional scaling. In McLaughlin, M. (Ed.) Communication Yearbook, 9 (pp. 315-338). Beverly Hills, CA: Sage.
数据挖掘研究院
Rice, R. E. (1982). Communication networking in computer-conferencing systems: A longitudinal study of group roles and system structure. In Burgoon, M. (Ed.), Communication Yearbook, 6 (pp. 925-944). Beverly Hills, CA: Sage.
数据挖掘研究院
Rice, R. E. (1994). Network analysis and computer-mediated communication systems. In Wasserman, S., & Galaskiewicz, J. (Eds.), Advances in social network analysis (pp. 167-203). Thousand Oaks: Sage.
Richards, W. D. Jr. (1995) The NEGOPY network analysis program. Burnaby, BC: Department of Communication, Simon Fraser University.
数据挖掘研究院
Richards, W. D. Jr., & Barnett, G. A. (Eds.). (1993). Progress in communication science, 12. Norwood, NJ: Ablex.
数据挖掘研究院
Rodríguez Gairín, J. M. (1997). Valorando el impacto de la informacion en Internet: AltaVista, el "citation index" de la Red, Revista Espanola de Documentacion Cientifica, 20:175-181. Retrieved January 3, 2003 from http://www.kronosdoc.com/publicacions/altavis.htm.
数据挖掘研究院
Rogers, E. M., & Bhowmik, D. K. (1971). Homophily-heterophily: Relational concepts for communication research. In Barker, L. L., & Kibler, R. J. (Eds). Speech communication behavior: Perspectives and principles (pp. 206-225). Englewood Cliffs, N.J.: Prentice-Hall, Inc.
数据挖掘研究院
Rogers, E. M., & Kincaid, D. L. (1981). Communication networks: Toward a new paradigm for research. New York: Free Press.
数据挖掘研究院
Rogers, R., & Marres, N. (2000). Landscaping climate change: A mapping technique for understanding science and technology debates on the world wide web. Public Understanding of Science, 9, 141-163.
Rousseau, R. (1997). Sitations: an exploratory study, Cybermetrics, 1. Retrieved January 3, 2003 from http://www.cindoc.csic.es/cybermetrics/articles/v1i1p1.html.
Rousseau, R. (1999). Daily time series of common single word searches in AltaVista and NorthernLight, Cybermetrics, 2/3. Retrieved January 3, 2003 from http://www.cindoc.csic.es/cybermetrics/articles/v2i1p2.html.
数据挖掘实验室
Scharnhorst, A. (2003). Complex networks and the Web-insights from non-linear physics. Journal of Computer-Mediated Communication 8(4). Available: http://www.ascusc.org/jcmc/vol8/issue4/scharnhorst.html.
Scott, J. (1991). Social network analysis: A handbook. Thousand Oaks, CA: Sage.
Smith, A. & Thelwall, M. (2002). Web impact factors for Australasian universities. Scientometrics, 54(3), 363-380.
Smith, A. G. (1999a). A tale of two Web spaces: Comparing sites using Web impact factors. Journal of Documentation, 55(5), 577-592.
Smith, A. G. (1999b). The Impact of Web sites: A comparison between Australasia and Latin America. In Proceedings of INFO′99, Congreso Internacional de Informacion, Havana, 4-8 October 1999. Retrieved January 3, 2003 from http://www.vuw.ac.nz/~agsmith/publns/austlat/.
Smith, M. (1999c). Invisible crowds in cyberspace: Measuring and mapping the social structure of USENET. In Smith, M., & Kollock, P. (Eds.), Communities in cyberspace (pp. 195-219). London: Routledge.
Snyder, H., & Rosenbaum, H. (1999). Can search engines be used for Web-link analysis? A critical review. Journal of Documentation, 55(4), 375-384.
Soualmia, L.F., Darmoni, S.J. Le Duff, F., Douyère, M., & Thelwall, M. (2002). Web impact factor: A bibliometric criterion applied to medical informatics societies′ Web sites. In Proceedings of MIE 2002, 数据挖掘实验室
Seventeenth International Congress of the European Federation for Medical Informatics, Studies in Health Technology & Informatics, 90, 178-183.
Sunstein, C. R. (2001). Republic.com. Princeton, NJ: Princeton University Press.
数据挖掘实验室
Tang, R., & Thelwall, M. (2003, in press). Disciplinary differences in US academic departmental web site interlinking. Library and Information Science Research.
数据挖掘研究院
Terveen, L., & Hill, W. (1998, November). Evaluating emergent collaboration on the Web. Conference of Computer Supported Cooperative Work, Seattle, Washington.
Thelwall, M., & Harries, G. (2003). The connection between the research of a university and counts of links to its Web pages: An investigation based upon a classification of the relationships of pages to the research of the host university. Journal of the American Society for Information Science and Technology, 54(7), 594-602.
数据挖掘研究院
Thelwall, M., & Smith, A. (2002). A study of the interlinking between Asia-Pacific University Web sites. Scientometrics, 55(3), 363-376.
数据挖掘研究院
Thelwall, M., Tang, R. & Price, E. (2003). Linguistic patterns of academic Web use in western Europe. Scientometrics, 56(3), 417-432.
数据挖掘研究院
Thelwall, M., & Tang, R. (2003, in press). Disciplinary and linguistic considerations for academic Web linking: An exploratory hyperlink mediated study with Mainland China and Taiwan. Scientometrics.
Thelwall, M., & Wilkinson, D. (2003a). Three target document range metrics for university Web sites. Journal of the American Society for Information Science and Technology, 54(6). 489-496.
Thelwall, M., & Wilkinson, D. (2003b). Graph structure in three national academic Webs: Power laws with anomalies. Journal of the American Society for Information Science and Technology, 706-712.
Thelwall, M., & Wilkinson, D. (2004, in press). Finding similar academic Web sites with links, bibliometric couplings and colinks. Information Processing & Management.
Thelwall, M. (2000). Web impact factors and search engine coverage. Journal of Documentation, 56(2), 185-189.
Thelwall, M. (2001a). Extracting macroscopic information from Web links. Journal of the American Society for Information Science and Technology, 52 (13), 1157-1168.
数据挖掘研究院
Thelwall, M. (2001b). Exploring the link structure of the Web with network diagrams. Journal of Information Science 27(6) 393-402.
Thelwall, M. (2001c), Commercial Web site links. Internet Research, 11(2), 114-124.
Thelwall, M. (2001d), Results from a Web Impact Factor crawler. Journal of Documentation, 57(2), 177-191.
数据挖掘研究院
Thelwall, M. (2001e). A publicly accessible database of UK university Website links and a discussion of the need for human intervention in Web crawling. Retrieved January 3, 2003 from http://www.scit.wlv.ac.uk/~cm1993/papers/a_publicly_accessible_database.pdf
Thelwall, M. (2001f) A Web crawler design for data mining. Journal of Information Science, 27(5), 319-325.
数据挖掘实验室
Thelwall, M. (2001g), The responsiveness of search engine indexes. Cybermetrics, 5(1). Retrieved January 3, 2003 from http://www.cindoc.csic.es/cybermetrics/articles/v5i1p1.html.
数据挖掘研究院
Thelwall, M. (2001h). Web log file analysis: Backlinks and queries. ASLIB Proceedings, 53(6), 217-223.
数据挖掘研究院
Thelwall, M. (2002a). The top 100 linked pages on UK university Web sites: High inlink counts are not usually directly associated with quality scholarly content. Journal of Information Science, 28(6), 485-493.
数据挖掘实验室
Thelwall, M. (2002b). A research and institutional size based model for national university Web site interlinking. Journal of Documentation, 58(6), 683-694.
Thelwall, M. (2002c). Evidence for the existence of geographic trends in university Web site interlinking. Journal of Documentation, 58(5), 563-574.
Thelwall, M. (2002d). Conceptualizing documentation on the Web: An evaluation of different heuristic-based models for counting links between university Web sites. Journal of the American Society for Information Science and Technology, 53(12), 995-1005.
数据挖掘研究院
Thelwall, M. (2002e). An initial exploration of the link relationship between UK university Web sites. ASLIB Proceedings, 54(2), 118-126.
数据挖掘研究院
Thelwall, M. (2002f). A comparison of sources of links for academic Web Impact Factor calculations. Journal of Documentation, 58(1), 60-72.
数据挖掘研究院
Thelwall, M. (2002g). What is this link doing here? Beginning a fine-grained process of identifying reasons for academic hyperlink creation. Information Research, 8(3), paper no. 151. Available at: http://informationr.net/ir/8-3/paper151.html.
数据挖掘研究院
Thelwall, M. (2002h). Methodologies for crawler based Web surveys. Internet Research: Electronic Networking and Applications, 12(2), 124-138.
Thelwall, M. (2002i). Research dissemination and invocation on the Web. Online Information Review, 26(6), 413-420.
Thelwall, M. (2002j) In praise of Google: Finding law journal Web sites. Online Information Review, 26(4), 271-272.
数据挖掘研究院
Thelwall, M. (2002k). Subject gateway sites and search engine ranking. Online Information Review, 26(2), 101-107.
数据挖掘实验室
Thelwall, M. (2003a, in press). Methods for reporting on the targets of links from national systems of university Web sites. Information Processing and Management.
Thelwall, M. (2003b). Web use and peer interconnectivity metrics for academic Web sites. Journal of Information Science, 29(1), 11-20.
数据挖掘实验室
Thelwall, M. (2003c). Can Google′s PageRank be used to find the most important academic Web pages? Journal of Documentation, 59(2), 205-217.
数据挖掘研究院
Thelwall, M. (2003d, in press). A layered approach for investigating the topological structure of communities in the Web, Journal of Documentation, 59(3).
数据挖掘研究院
Thomas, O. & Willett, P. (2000). Webometric analysis of departments of librarianship and information science. Journal of Information Science, 26(6), 421-428.
Torgerson, W. S. (1958). Theory and methods of scaling. New York: Wiley.
Vaughan, L. & Thelwall, M. (2003). Scholarly use of the Web: What are the key inducers of links to journal Web sites? Journal of the American Society for Information Science and Technology, 54(1), 29-38.
Vaughan, L. Q. & Hysen, K. (2002). Do Web link counts resemble citation counts: An empirical examination. ASLIB Proceedings, 54(6), 356-361.
数据挖掘研究院
Wallerstein, I. (1976). The modern world system. New York: Academic Press.
数据挖掘研究院
Walsh, J. P., & Maloney, N. G. (2002). Computer network use, collaboration structures and productivity. In Hinds, P., & Kiesler, S. (Eds.), Distributed work. (pp. 433-458). Cambridge, MA: MIT Press. Retrieved June 10, 2002 from http://tigger.uic.edu/~jwalsh/Collab.html. 数据挖掘研究院
Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications. Cambridge, NY: Cambridge University Press
Watts, D. J., & Strogatz, S. H. (1998). Collective dynamics of `small-world′ networks. Nature, 393, 440-442.
Weare, C., & Lin, W. Y. (2000). Content analysis of the World Wide Web-Opportunities and challenges. Social Science Computer Review, 18(3), 272-292.
Web, E. J. (1966). Unobtrusive measures: Nonreactive research in the social sciences. Chicago: Rand McNally.
数据挖掘实验室
Wellman, B. (2001). Computer networks as social networks. Science, 293(14). 2031-2034.
Wellman, B., & Berkowitz, S. D. (1989). Social structures: A network approach. New York: Cambridge University Press.
数据挖掘研究院
Wilkinson, D., Harries, G., Thelwall, M., & Price, E. (2003). Motivations for academic Web site interlinking: Evidence for the Web as a novel source of information on informal scholarly communication. Journal of Information Science, 29(1), 59-66.
数据挖掘研究院
Wouters, P. F. (1999). The citation culture. Ph. D. thesis. University of Amsterdam.
Wouters, P. F., & Gerber, D. (2003). Interactive Internet? Studying mediated interaction with publicly available search engines. Journal of Computer-Mediated Communication, 8(4). Available: http://www.ascusc.org/jcmc/vol8/issue4/wouters.html.
数据挖掘研究院

