Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Skip header Section
Information retrieval: data structures and algorithmsJune 1992
Publisher:
  • Prentice-Hall, Inc.
  • Division of Simon and Schuster One Lake Street Upper Saddle River, NJ
  • United States
ISBN:978-0-13-463837-9
Published:01 June 1992
Pages:
497
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
chapter
Inverted files
pp 28–43
chapter
Signature files
pp 44–65
chapter
Lexical analysis and stoplists
pp 102–130
chapter
Stemming algorithms
pp 131–160
chapter
Thesaurus construction
pp 161–218
chapter
String searching algorithms
pp 219–240
chapter
Boolean operations
pp 264–292
chapter
chapter
Ranking algorithms
pp 363–392
chapter
Extended Boolean models
pp 393–418
chapter
Clustering algorithms
pp 419–442
chapter
chapter

Cited By

  1. Noel R, Panach J and Pastor O (2023). Including business strategy in model-driven methods: an experiment, Requirements Engineering, 28:3, (411-440), Online publication date: 1-Sep-2023.
  2. Parish Z, Cushing C, Aggarwal S, Salehi-Abari A and Thorpe J (2022). Password guessers under a microscope: an in-depth analysis to inform deployments, International Journal of Information Security, 21:2, (409-425), Online publication date: 1-Apr-2022.
  3. ACM
    Song S, Huang R, Gao Y and Wang J Why Not Match: On Explanations of Event Pattern Queries Proceedings of the 2021 International Conference on Management of Data, (1705-1717)
  4. Ghahramani F, Tahayori H and Visconti A (2021). Effects of central tendency measures on term weighting in textual information retrieval, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 25:11, (7341-7378), Online publication date: 1-Jun-2021.
  5. Shambour Q, Turab N and Adwan O (2021). An Effective e-Commerce Recommender System Based on Trust and Semantic Information, Cybernetics and Information Technologies, 21:1, (103-118), Online publication date: 1-Mar-2021.
  6. ACM
    El-Ansari A, Beni-Hssane A and Saadi M An improved modeling method for profile-based personalized search Proceedings of the 3rd International Conference on Networking, Information Systems & Security, (1-6)
  7. Yurochkin M, Claici S, Chien E, Mirzazadeh F and Solomon J Hierarchical optimal transport for document representation Proceedings of the 33rd International Conference on Neural Information Processing Systems, (1601-1611)
  8. ACM
    Lovering C, Lu A, Nguyen C, Nguyen H, Hurley D and Agu E (2018). Fact or Fiction, Proceedings of the ACM on Human-Computer Interaction, 2:CSCW, (1-15), Online publication date: 1-Nov-2018.
  9. Antoniol G, Ayari K, Di Penta M, Khomh F and Guéhéneuc Y Is it a bug or an enhancement? Proceedings of the 28th Annual International Conference on Computer Science and Software Engineering, (2-16)
  10. ACM
    Abrahão S, Insfran E, de Guevara F, Fernández-Diego M, Cano-Genoves C and de Oliveira R Comparing the effectiveness of goal-oriented languages Proceedings of the 12th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, (1-4)
  11. ACM
    Canfora G, Di Sorbo A, Emanuele E, Forootani S and Visaggio C A Nlp-based Solution to Prevent from Privacy Leaks in Social Network Posts Proceedings of the 13th International Conference on Availability, Reliability and Security, (1-6)
  12. ACM
    Lee J, Kim D, Bissyandé T, Jung W and Le Traon Y Bench4BL: reproducibility study on the performance of IR-based bug localization Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, (61-72)
  13. Al-Obeidallah M, Petridis M and Kapetanakis S (2018). A Multiple Phases Approach for Design Patterns Recovery Based on Structural and Method Signature Features, International Journal of Software Innovation, 6:3, (36-52), Online publication date: 1-Jul-2018.
  14. Fan Q, Yu Y, Yin G, Wang T and Wang H Where is the road for issue reports classification based on text mining? Proceedings of the 11th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, (121-130)
  15. ACM
    Jovanovska J, Bozhinova I and Zdravkova K Information Retrieval with Reinforced Word Classes Proceedings of the 8th Balkan Conference in Informatics, (1-8)
  16. Gopalakrishnan R, Sharma P, Mirakhorli M and Galster M Can latent topics in source code predict missing architectural tactics? Proceedings of the 39th International Conference on Software Engineering, (15-26)
  17. Beheshti S, Benatallah B, Venugopal S, Ryu S, Motahari-Nezhad H and Wang W (2017). A systematic review and comparative analysis of cross-document coreference resolution methods and tools, Computing, 99:4, (313-349), Online publication date: 1-Apr-2017.
  18. Morstatter F and Liu H (2017). In search of coherence and consensus, The Journal of Machine Learning Research, 18:1, (6177-6208), Online publication date: 1-Jan-2017.
  19. ACM
    Panichella S, Di Sorbo A, Guzman E, Visaggio C, Canfora G and Gall H ARdoc: app reviews development oriented classifier Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, (1023-1027)
  20. Sharma A, Sharma A, Deodhare D, Chakraborti S, Kumar P and Mitra P Case Representation and Retrieval Techniques for Neuroanatomical Connectivity Extraction from PubMed Case-Based Reasoning Research and Development, (370-386)
  21. Roy S, Muni D, Tack Yan J, Budhiraja N and Ceiler F Clustering and Labeling IT Maintenance Tickets Service-Oriented Computing, (829-845)
  22. Wang H, Kessentini M and Ouni A Bi-level Identification of Web Service Defects Service-Oriented Computing, (352-368)
  23. Colvin E and Kraft D (2016). Fuzzy retrieval for software reuse, Journal of the Association for Information Science and Technology, 67:10, (2454-2463), Online publication date: 1-Oct-2016.
  24. ACM
    Ferragina P and Venturini R (2016). Compressed Cache-Oblivious String B-Tree, ACM Transactions on Algorithms, 12:4, (1-17), Online publication date: 2-Sep-2016.
  25. (2016). The added value of auxiliary data in sentiment analysis of Facebook posts, Decision Support Systems, 89:C, (98-112), Online publication date: 1-Sep-2016.
  26. Li H, Guan Y, Liu L, Wang F and Wang L (2016). Re-ranking for microblog retrieval via multiple graph model, Multimedia Tools and Applications, 75:15, (8939-8954), Online publication date: 1-Aug-2016.
  27. References Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  28. Appendixes Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  29. Toward A Unified System for Text Management and Analysis Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  30. Joint Analysis of Text and Structured Data Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  31. Opinion Mining and Sentiment Analysis Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  32. Topic Analysis Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  33. Text Summarization Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  34. Text Categorization Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  35. Text Clustering Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  36. Word Association Mining Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  37. Overview of Text Data Analysis Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  38. Recommender Systems Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  39. Web Search Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  40. Search Engine Evaluation Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  41. Search Engine Implementation Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  42. Feedback Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  43. Retrieval Models Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  44. Overview of Text Data Access Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  45. MeTA : A Unified Toolkit for Text Data Management and Analysis Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  46. Text Data Understanding Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  47. Background Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  48. Introduction Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  49. Preface Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
  50. ACM
    Anish P, Balasubramaniam B, Sainani A, Cleland-Huang J, Daneva M, Wieringa R and Ghaisas S Probing for requirements knowledge to stimulate architectural thinking Proceedings of the 38th International Conference on Software Engineering, (843-854)
  51. Di Sorbo A, Panichella S, Visaggio C, Di Penta M, Canfora G and Gall H Development emails content analyzer Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, (12-23)
  52. Colvin E and Kraft D Fuzzy retrieval for software reuse Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community, (1-3)
  53. ACM
    Hao S, Zhao P, Hoi S and Miao C Learning Relative Similarity from Data Streams Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, (1181-1190)
  54. ACM
    Silva G, Ferreira R, Lins R, Cabral L, Oliveira H, Simske S and Riss M Automatic Text Document Summarization Based on Machine Learning Proceedings of the 2015 ACM Symposium on Document Engineering, (191-194)
  55. ACM
    Wang L, Tasoulis S, Roos T and Kangasharju J Kvasir Proceedings of the 24th International Conference on World Wide Web, (251-254)
  56. ACM
    Baba S, Toriumi F, Sakaki T, Shinoda K, Kurihara S, Kazama K and Noda I Classification Method for Shared Information on Twitter Without Text Data Proceedings of the 24th International Conference on World Wide Web, (1173-1178)
  57. Chakrabarti A and Parthasarathy S Sequential Hypothesis Tests for Adaptive Locality Sensitive Hashing Proceedings of the 24th International Conference on World Wide Web, (162-172)
  58. ACM
    Doherty J, Curran K and McKevitt P (2015). Pattern Matching Techniques for Replacing Missing Sections of Audio Streamed across Wireless Networks, ACM Transactions on Intelligent Systems and Technology, 6:2, (1-38), Online publication date: 4-May-2015.
  59. Amir A, Apostolico A, Landau G, Porat E and Sar Shalom O (2015). A PTAS for the Square Tiling Problem, Theoretical Computer Science, 562:C, (33-45), Online publication date: 11-Jan-2015.
  60. ACM
    Anh N, Toi N and Linh N An interpretable method for text summarization based on simplicial non-negative matrix factorization Proceedings of the 5th Symposium on Information and Communication Technology, (57-64)
  61. ACM
    Singh A and Rana A Generate frequent queries for Views in a Data Warehouse using Data Mining Techniques Proceedings of the 2014 International Conference on Information and Communication Technology for Competitive Strategies, (1-6)
  62. Gilbert E and Karahalios K (2014). Computing and Building Around Tie Strength in Social Media, Foundations and Trends in Human-Computer Interaction, 7:3, (237-349), Online publication date: 1-Oct-2014.
  63. Akbar M, Shaffer C, Fan W and Fox E Recommendation based on deduced social networks in an educational digital library Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries, (29-38)
  64. ACM
    Djellali C A new conceptual model for dynamic text clustering Using unstructured text as a case Proceedings of the 2014 International C* Conference on Computer Science & Software Engineering, (1-7)
  65. ACM
    Sulaiman S, Omar K, Omar N, Murah M and Rahman H (2014). The Effectiveness of a Jawi Stemmer for Retrieving Relevant Malay Documents in Jawi Characters, ACM Transactions on Asian Language Information Processing, 13:2, (1-21), Online publication date: 1-Jun-2014.
  66. Parra-Arnau J, Rebollo-Monedero D and Forné J (2014). Measuring the privacy of user profiles in personalized information systems, Future Generation Computer Systems, 33, (53-63), Online publication date: 1-Apr-2014.
  67. Köse G, Tonta Y, Ahmadlouei H and Polatkan A Story Link Detection in Turkish Corpus Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 01, (154-158)
  68. Bello G, Menéndez H, Okazaki S and Camacho D Extracting Collective Trends from Twitter Using Social-Based Data Mining Proceedings of the 5th International Conference on Computational Collective Intelligence. Technologies and Applications - Volume 8083, (622-630)
  69. ACM
    Bello-Orgaz G and Camacho D Comparative study of text clustering techniques in virtual worlds Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, (1-8)
  70. ACM
    Toriumi F, Sakaki T, Shinoda K, Kazama K, Kurihara S and Noda I Information sharing on Twitter during the 2011 catastrophic earthquake Proceedings of the 22nd International Conference on World Wide Web, (1025-1028)
  71. Choi H, Jung H, Lee K and Chung Y (2013). Skyline queries on keyword-matched data, Information Sciences: an International Journal, 232, (449-463), Online publication date: 1-May-2013.
  72. Delater A and Paech B Analyzing the tracing of requirements and source code during software development Proceedings of the 19th international conference on Requirements Engineering: Foundation for Software Quality, (308-314)
  73. Parra-Arnau J, Rebollo-Monedero D, Forné J, MuñOz J and Esparza O (2012). Optimal tag suppression for privacy protection in the semantic Web, Data & Knowledge Engineering, 81-82, (46-66), Online publication date: 1-Nov-2012.
  74. ACM
    Wong R, Shi F and Lam N Full-text search on multi-byte encoded documents Proceedings of the 2012 ACM symposium on Document engineering, (227-236)
  75. Ching W, Chu D, Liao L and Wang X (2012). Regularized orthogonal linear discriminant analysis, Pattern Recognition, 45:7, (2719-2732), Online publication date: 1-Jul-2012.
  76. Tovar M, Reyes J, Montes A, Vilariño D, Pinto D and León S BUAP Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, (502-505)
  77. ACM
    Gilbert E Designing social translucence over social networks Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, (2731-2740)
  78. Vlas R and Robinson W (2012). Two Rule-Based Natural Language Strategies for Requirements Discovery and Classification in Open Source Software Development Projects, Journal of Management Information Systems, 28:4, (11-38), Online publication date: 1-Apr-2012.
  79. ACM
    Ferreira R, Lima R, Melo J, Costa E, Freitas F and Pacca H RetriBlog Proceedings of the 27th Annual ACM Symposium on Applied Computing, (696-701)
  80. ACM
    Zanker M, Jessenitschnig M and Stromberger M Harnessing geo-tagged resources for Web personalization Proceedings of the 27th Annual ACM Symposium on Applied Computing, (332-339)
  81. Goyal P and Mehala N Concept based query recommendation Proceedings of the Ninth Australasian Data Mining Conference - Volume 121, (69-78)
  82. Hose K, Schenkel R, Theobald M and Weikum G Database foundations for scalable RDF processing Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data, (202-249)
  83. Chahine C, Chaignaud N, Kotowicz J and Pecuchet J Conceptual Indexing of Documents Using Wikipedia Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01, (195-202)
  84. Shih C, Chen M, Chu H and Chen Y (2011). Enhancement of domain ontology construction using a crystallizing approach, Expert Systems with Applications: An International Journal, 38:6, (7544-7557), Online publication date: 1-Jun-2011.
  85. ACM
    Miyata A and Fujimura K Document area identification for extending books without markers Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, (3189-3198)
  86. Mitankin P, Mihov S and Schulz K (2011). Deciding word neighborhood with universal neighborhood automata, Theoretical Computer Science, 412:22, (2340-2355), Online publication date: 1-May-2011.
  87. Ma Y, Chung C and Chen T (2011). Load and storage balanced posting file partitioning for parallel information retrieval, Journal of Systems and Software, 84:5, (864-884), Online publication date: 1-May-2011.
  88. Cao Y, Liu F, Simpson P, Antieau L, Bennett A, Cimino J, Ely J and Yu H (2011). AskHERMES, Journal of Biomedical Informatics, 44:2, (277-288), Online publication date: 1-Apr-2011.
  89. ACM
    Terrovitis M, Bouros P, Vassiliadis P, Sellis T and Mamoulis N Efficient answering of set containment queries for skewed item distributions Proceedings of the 14th International Conference on Extending Database Technology, (225-236)
  90. ACM
    Park S, An D and Yoo H Document clustering using NMF and fuzzy relation Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, (1-5)
  91. ACM
    Wiesner M and Pfeifer D Adapting recommender systems to the requirements of personal health record systems Proceedings of the 1st ACM International Health Informatics Symposium, (410-414)
  92. Ceglarek D, Haniewicz K and Rutkowski W Quality of semantic compression in classification Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume PartI, (162-171)
  93. Bu F, Zhu X, Hao Y and Zhu X Function-based question classification for general QA Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, (1119-1128)
  94. Vanetti M, Binaghi E, Carminati B, Carullo M and Ferrari E Content-based filtering in on-line social networks Proceedings of the international ECML/PKDD conference on Privacy and security issues in data mining and machine learning, (127-140)
  95. Loponen A and Järvelin K A dictionary- and corpus-independent statistical lemmatizer for information retrieval in low resource languages Proceedings of the 2010 international conference on Multilingual and multimodal information access evaluation: cross-language evaluation forum, (3-14)
  96. ACM
    Torchiano M and Ricca F Impact analysis by means of unstructured knowledge in the context of bug repositories Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, (1-4)
  97. ACM
    Ricca F, Scanniello G, Torchiano M, Reggio G and Astesiano E On the effectiveness of screen mockups in requirements engineering Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, (1-10)
  98. Halabi A, Islim A and Kurdi M A hybrid approach for indexing and retrieval of archaeological textual information Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part IV, (527-535)
  99. Parra-Arnau J, Rebollo-Monedero D and Forné J A privacy-preserving architecture for the semantic web based on tag suppression Proceedings of the 7th international conference on Trust, privacy and security in digital business, (58-68)
  100. ACM
    Zhang Q, Zhang Y, Yu H and Huang X Efficient partial-duplicate detection based on sequence matching Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, (675-682)
  101. Chen B, Foster G and Kuhn R Bilingual sense similarity for statistical machine translation Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, (834-843)
  102. Sun L, Versteeg S, Boztaş S and Yann T Pattern recognition techniques for the classification of malware packers Proceedings of the 15th Australasian conference on Information security and privacy, (370-390)
  103. ACM
    Di Penta M, German D, Guéhéneuc Y and Antoniol G An exploratory study of the evolution of software licensing Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1, (145-154)
  104. Moha N, Guéhéneuc Y, Meur A, Duchien L and Tiberghien A (2010). From a domain analysis to the specification and detection of code and design smells, Formal Aspects of Computing, 22:3, (345-361), Online publication date: 1-May-2010.
  105. Chu D and Thye G (2010). A new and fast implementation for null space based linear discriminant analysis, Pattern Recognition, 43:4, (1373-1379), Online publication date: 1-Apr-2010.
  106. ACM
    Tan C, Sheng B, Wang H and Li Q (2010). Microsearch, ACM Transactions on Embedded Computing Systems, 9:4, (1-29), Online publication date: 1-Mar-2010.
  107. ACM
    Gilbert E and Karahalios K Understanding deja reviewers Proceedings of the 2010 ACM conference on Computer supported cooperative work, (225-228)
  108. Crochemore M and Hancart C Pattern matching in strings Algorithms and theory of computation handbook, (13-13)
  109. Larsen J, Halling S, Sigurðsson M and Hansen L MuZeeker Mobile Multimedia Processing, (154-169)
  110. Ji J, Chan T and Zhao Q Fast document clustering based on weighted comparative advantage Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics, (541-546)
  111. ACM
    Cleland-Huang J, Dumitru H, Duan C and Castro-Herrera C (2009). Automated support for managing feature requests in open forums, Communications of the ACM, 52:10, (68-74), Online publication date: 1-Oct-2009.
  112. Jiang L, Wu Z, Zheng Q and Liu J Learning Deep Web Crawling with Diverse Features Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, (572-575)
  113. Swaminathan A, Mathew C and Kirovski D Essential Pages Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, (173-182)
  114. Wang X, Lai G and Liu C (2009). Recovering Relationships between Documentation and Source Code based on the Characteristics of Software Engineering, Electronic Notes in Theoretical Computer Science (ENTCS), 243, (121-137), Online publication date: 1-Jul-2009.
  115. Stamou S, Kozanidis L, Tzekou P and Zotos N (2009). Ontology-driven personalized query refinement, Journal of Web Engineering, 8:2, (113-153), Online publication date: 1-Jun-2009.
  116. ACM
    Gilbert E and Karahalios K Predicting tie strength with social media Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, (211-220)
  117. ACM
    Shao P and Smith R Feature location by IR modules and call graph Proceedings of the 47th Annual Southeast Regional Conference, (1-4)
  118. ACM
    Jeong O and Lee S An efficient clustering framework for relevant web information Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication, (450-456)
  119. ACM
    Chen C, Chen M and Chen M (2009). An adaptive threshold framework for event detection using HMM-based life profiles, ACM Transactions on Information Systems, 27:2, (1-35), Online publication date: 1-Feb-2009.
  120. Dehghan S and Rahmani A A Classifier-CMAC Neural Network Model for Web Mining Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, (427-431)
  121. Formica A, Missikoff M, Pourabbas E and Taglino F Weighted Ontology for Semantic Search Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems, (1289-1303)
  122. ACM
    Haruechaiyasak C, Kongyoung S and Damrongrat C LearnLexTo Proceedings of the 2nd ACM workshop on Improving non english web searching, (85-88)
  123. ACM
    Antoniol G, Ayari K, Di Penta M, Khomh F and Guéhéneuc Y Is it a bug or an enhancement? Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds, (304-318)
  124. ACM
    Parikh N and Sundaresan N Inferring semantic query relations from collective user behavior Proceedings of the 17th ACM conference on Information and knowledge management, (349-358)
  125. ACM
    Zanker M A collaborative constraint-based meta-level recommender Proceedings of the 2008 ACM conference on Recommender systems, (139-146)
  126. ACM
    Pandey A and Siddiqui T An unsupervised Hindi stemmer with heuristic improvements Proceedings of the second workshop on Analytics for noisy unstructured text data, (99-105)
  127. ACM
    Guo J, Xu G, Li H and Cheng X A unified and discriminative model for query refinement Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, (379-386)
  128. Chowdary C and Sreenivasa Kumar P Sentence Ordering for Coherent Multi-document Summary Generation Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge, (40-50)
  129. Chang Y, Wang J and Lalmas M Generation of Query-Biased Concepts Using Content and Structure for Query Reformulation Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems, (136-141)
  130. Elsayed T, Lin J and Oard D Pairwise document similarity in large collections with MapReduce Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, (265-268)
  131. Moha N, Guéhéneuc Y, Le Meur A and Duchien L A domain analysis to specify design defects and generate detection algorithms Proceedings of the Theory and practice of software, 11th international conference on Fundamental approaches to software engineering, (276-291)
  132. ACM
    Eltabakh M, Hon W, Shah R, Aref W and Vitter J The SBC-tree Proceedings of the 11th international conference on Extending database technology: Advances in database technology, (523-534)
  133. Jee H, Lee J and Hong D High speed search for large-scale digital forensic investigation Proceedings of the 1st international conference on Forensic applications and techniques in telecommunications, information, and multimedia and workshop, (1-4)
  134. Yang C, Chen I, Hung C and Wu P Improving hierarchical taxonomy integration with semantic feature expansion on category-specific terms Proceedings of the 4th Asia information retrieval conference on Information retrieval technology, (225-236)
  135. Vitter J (2008). Algorithms and data structures for external memory, Foundations and Trends® in Theoretical Computer Science, 2:4, (305-474), Online publication date: 1-Jan-2008.
  136. Tummarello G, Delbru R and Oren E Sindice.com Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference, (552-565)
  137. Tummarello G, Delbru R and Oren E Sindice.com: Weaving the Open Linked Data The Semantic Web, (552-565)
  138. ACM
    Daoud A Effective ranked conceptual retrieval Proceedings of the ACM first workshop on CyberInfrastructure: information management in eScience, (55-60)
  139. ACM
    Duan C and Cleland-Huang J Clustering support for automated tracing Proceedings of the 22nd IEEE/ACM International Conference on Automated Software Engineering, (244-253)
  140. Ayari K, Meshkinfam P, Antoniol G and Di Penta M Threats on building models from CVS and Bugzilla repositories Proceedings of the 2007 conference of the center for advanced studies on Collaborative research, (215-228)
  141. Veilumuthu A and Ramachandran P Discovering implicit feedbacks from search engine log files Proceedings of the 10th international conference on Discovery science, (231-242)
  142. Barroso N, Ezeiza A, Gilisagasti N, de Ipiña K, López A and López J First approach in the development of multimedia information retrieval resources for the Basque context Proceedings of the 10th international conference on Text, speech and dialogue, (582-590)
  143. Du X, Song W and Zhang M A context-based framework and method for learning object description and search Proceedings of the 6th international conference on Advances in web based learning, (114-125)
  144. ACM
    Brezeale D and Cook D Learning video preferences from video content Proceedings of the 8th international workshop on Multimedia data mining: (associated with the ACM SIGKDD 2007), (1-9)
  145. Chen Z and Fu B (2007). On the complexity of Rocchio's similarity-based relevance feedback algorithm, Journal of the American Society for Information Science and Technology, 58:10, (1392-1400), Online publication date: 1-Aug-2007.
  146. Pinto D, Rosso P and Jiménez-Salazar H UPV-SI Proceedings of the 4th International Workshop on Semantic Evaluations, (430-433)
  147. ACM
    Fallen C and Newby G Distributed web search efficiency by truncating results Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, (195-203)
  148. Ma W, Wang W and Liu J Scalable keyword search based on semantic in DHT based peer-to-peer system Proceedings of the 2nd international conference on Scalable information systems, (1-4)
  149. ACM
    Mazeika A, Böhlen M, Koudas N and Srivastava D (2007). Estimating the selectivity of approximate string queries, ACM Transactions on Database Systems, 32:2, (12-es), Online publication date: 1-Jun-2007.
  150. Ricca F, Di Penta M, Torchiano M, Tonella P and Ceccato M The Role of Experience and Ability in Comprehension Tasks Supported by UML Stereotypes Proceedings of the 29th international conference on Software Engineering, (375-384)
  151. Kanongchaiyos P Calculation of document similarity using cellular structured space template Proceedings of the third conference on IASTED International Conference: Advances in Computer Science and Technology, (311-316)
  152. Cleland-Huang J, Settimi R, Zou X and Solc P (2007). Automated classification of non-functional requirements , Requirements Engineering, 12:2, (103-120), Online publication date: 1-Apr-2007.
  153. Zhou X and Yu H A clustering-based approach for tracing object-oriented design to requirement Proceedings of the 10th international conference on Fundamental approaches to software engineering, (412-422)
  154. ACM
    Lee J, Oh J, Shah S, Yuan X and Tang S Automatic classification of digestive organs in wireless capsule endoscopy videos Proceedings of the 2007 ACM symposium on Applied computing, (1041-1045)
  155. ACM
    Lamprier S, Amghar T, Levrat B and Saubion F ClassStruggle Proceedings of the 2007 ACM symposium on Applied computing, (600-604)
  156. Braschler M and Ferro N Adding multilingual information access to the European library Proceedings of the 1st international conference on Digital libraries: research and development, (218-227)
  157. Schickel-Zuber V and Faltings B OSS Proceedings of the 20th international joint conference on Artifical intelligence, (551-556)
  158. Yang K and Shahabi C (2007). An efficient k nearest neighbor search for multivariate time series, Information and Computation, 205:1, (65-98), Online publication date: 1-Jan-2007.
  159. Melton G, Parsons S, Morrison F, Rothschild A, Markatou M and Hripcsak G (2006). Inter-patient distance metrics using SNOMED CT defining relationships, Journal of Biomedical Informatics, 39:6, (697-705), Online publication date: 1-Dec-2006.
  160. ACM
    Becerra-Fernandez I (2006). Searching for experts on the Web, ACM Transactions on Internet Technology, 6:4, (333-355), Online publication date: 1-Nov-2006.
  161. Aasheim C and Koehler G (2006). Scanning world wide web documents with the vector space model, Decision Support Systems, 42:2, (690-699), Online publication date: 1-Nov-2006.
  162. Ferri F, Formica A, Grifoni P and Rafanelli M Query approximation by semantic similarity in GeoPQL Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II, (1670-1680)
  163. Yang K, Yu N, Zhang H, Akram S and Record I WIDIT Proceedings of the Third Asia conference on Information Retrieval Technology, (649-658)
  164. Ho J, Chen I and Yang C Learning to integrate web catalogs with conceptual relationships in hierarchical thesaurus Proceedings of the Third Asia conference on Information Retrieval Technology, (217-229)
  165. Farah M and Vanderpooten D A multiple criteria approach for information retrieval Proceedings of the 13th international conference on String Processing and Information Retrieval, (242-254)
  166. Ko M and Lee Y Reserve price recommendation by similarity-based time series analysis for internet auction systems Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I, (292-299)
  167. Chen Y, Wei J, Wu S and Hu Y (2006). A similarity-based method for retrieving documents from the SCI/SSCI database, Journal of Information Science, 32:5, (449-464), Online publication date: 1-Oct-2006.
  168. Zanker M, Gordea S, Jessenitschnig M and Schnabl M A hybrid similarity concept for browsing semi-structured product items Proceedings of the 7th international conference on E-Commerce and Web Technologies, (21-30)
  169. Zhang X, Li B, Mu W and Liu Y Large quantity of text classification based on the improved feature-line method Proceedings of the 9th Pacific Rim international conference on Artificial intelligence, (515-523)
  170. Godoy D and Amandi A (2006). A conceptual clustering approach for user profiling in personal information agents, AI Communications, 19:3, (207-227), Online publication date: 1-Aug-2006.
  171. ACM
    Zobel J and Moffat A (2006). Inverted files for text search engines, ACM Computing Surveys, 38:2, (6-es), Online publication date: 25-Jul-2006.
  172. Yao J, Zhao S and Fan L An enhanced support vector machine model for intrusion detection Proceedings of the First international conference on Rough Sets and Knowledge Technology, (538-543)
  173. Méndez J, Fdez-Riverola F, Díaz F, Iglesias E and Corchado J A comparative performance study of feature selection methods for the anti-spam filtering domain Proceedings of the 6th Industrial Conference on Data Mining conference on Advances in Data Mining: applications in Medicine, Web Mining, Marketing, Image and Signal Mining, (106-120)
  174. Cambazoglu B and Aykanat C (2006). Performance of query processing implementations in ranking-based text retrieval systems using inverted indices, Information Processing and Management: an International Journal, 42:4, (875-898), Online publication date: 1-Jul-2006.
  175. Chang Y, Kim M and Raghavan V (2006). Construction of query concepts based on feature clustering of documents, Information Retrieval, 9:3, (231-248), Online publication date: 1-Jun-2006.
  176. Ponnusamy R, Gopal T and Vaidyanathan S A Hierarchical Concept-matrix Patterned Multi-Agent Based Automated Text Classification Method for Digital Libraries Proceedings of the 2006 conference on Advances in Intelligent IT: Active Media Technology 2006, (372-379)
  177. Gonzalez M, de Lima V and de Lima J Tools for nominalization Proceedings of the 7th international conference on Computational Processing of the Portuguese Language, (100-109)
  178. Woo Y, Nam D, Hur T, Park Y, Huh W, Woo Y and Min H Automated keyword extraction using category correlation of data Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part II, (224-230)
  179. Ernst-Gerlach A and Fuhr N Generating search term variants for text collections with historic spellings Proceedings of the 28th European conference on Advances in Information Retrieval, (49-60)
  180. Gonzalez M, de Lima V and de Lima J Lexical normalization and relationship alternatives for a term dependence model in information retrieval Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing, (394-405)
  181. Joo K and Lee S An incremental document clustering algorithm based on a hierarchical agglomerative approach Proceedings of the Second international conference on Distributed Computing and Internet Technology, (321-332)
  182. Chen Z and Fu B On the complexity of rocchio's similarity-based relevance feedback algorithm Proceedings of the 16th international conference on Algorithms and Computation, (216-225)
  183. Alvares R, Garcia A and Ferraz I STEMBR Proceedings of the 12th Portuguese conference on Progress in Artificial Intelligence, (693-701)
  184. Yang K and Shahabi C On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis Proceedings of the Fifth IEEE International Conference on Data Mining, (805-808)
  185. ACM
    Stirewalt R, Deng M and Cheng B UML formalization is a traceability problem Proceedings of the 3rd international workshop on Traceability in emerging forms of software engineering, (31-36)
  186. de Madariaga R, del Castillo J and Hilera J A generalization of the method for evaluation of stemming algorithms based on error counting Proceedings of the 12th international conference on String Processing and Information Retrieval, (228-233)
  187. Lloyd L, Kechagias D and Skiena S Lydia Proceedings of the 12th international conference on String Processing and Information Retrieval, (161-166)
  188. Ferri F, Formica A, Grifoni P and Rafanelli M Evaluating semantic similarity using GML in geographic information systems Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems, (1009-1019)
  189. Chen I, Ho J and Yang C An iterative approach for web catalog integration with support vector machines Proceedings of the Second Asia conference on Asia Information Retrieval Technology, (703-708)
  190. Joo K and Lee W An incremental document clustering for the large document database Proceedings of the Second Asia conference on Asia Information Retrieval Technology, (374-387)
  191. Yang K and Yu N WIDIT Proceedings of the Second Asia conference on Asia Information Retrieval Technology, (206-220)
  192. Martín-Valdivia M, Martínez-Santiago F and Ureña-López L (2005). Merging Strategy for Cross-Lingual Information Retrieval Systems based on Learning Vector Quantization, Neural Processing Letters, 22:2, (149-161), Online publication date: 1-Oct-2005.
  193. Morita K, Atlam E, Fuketa M, Ghada E, Oono M, Sumitomo T and Aoe J New approach for speeding-up technique of the retrieval using dynamic full-text search algorithm Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part IV, (619-625)
  194. Liu Z and Chen D Classification of chromosome sequences with entropy kernel and LKPLS algorithm Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part I, (543-551)
  195. Kim H and Chan P Personalized search results with user interest hierarchies learnt from bookmarks Proceedings of the 7th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis, (158-176)
  196. Zhou L and Hovy E Digesting virtual "geek" culture Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, (298-305)
  197. ACM
    Han H, Zha H and Giles C Name disambiguation in author citations using a K-way spectral clustering method Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries, (334-343)
  198. Jung S, Hong J and Kim T (2005). A Statistical Model for User Preference, IEEE Transactions on Knowledge and Data Engineering, 17:6, (834-843), Online publication date: 1-Jun-2005.
  199. ACM
    Cleland-Huang J, Settimi R, BenKhadra O, Berezhanskaya E and Christina S Goal-centric traceability for managing non-functional requirements Proceedings of the 27th international conference on Software engineering, (362-371)
  200. Cantone D, Ferro A, Giugno R, Presti G and Pulvirenti A Multiple-Winners randomized tournaments with consensus for optimization problems in generic metric spaces Proceedings of the 4th international conference on Experimental and Efficient Algorithms, (265-276)
  201. Kim G and Han J Thesaurus contruction using class inheritance Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part III, (748-757)
  202. Guo L, Shanmugasundaram J, Beyer K and Shekita E Efficient Inverted Lists and Query Algorithms for Structured Value Ranking in Update-Intensive Relational Databases Proceedings of the 21st International Conference on Data Engineering, (298-309)
  203. ACM
    Huang H, Shen L, Makedon F, Zhang S, Greenberg M, Gao L and Pearlman J A clustering-based approach for prediction of cardiac resynchronization therapy Proceedings of the 2005 ACM symposium on Applied computing, (260-266)
  204. Díaz I, Llorens J, Genova G and Fuentes J (2005). Generating domain representations using a relationship model, Information Systems, 30:1, (1-19), Online publication date: 1-Mar-2005.
  205. Mladenić D Feature selection for dimensionality reduction Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection, (84-102)
  206. Badr Y and Chbeir R Automatic image description based on textual data Journal on Data Semantics VII, (196-218)
  207. Chollet G, McTait K and Petrovska-Delacrétaz D Data driven approaches to speech and language processing Nonlinear Speech Modeling and Applications, (164-198)
  208. Anh V and Moffat A (2005). Inverted Index Compression Using Word-Aligned Binary Codes, Information Retrieval, 8:1, (151-166), Online publication date: 1-Jan-2005.
  209. ACM
    Yang K and Shahabi C A PCA-based similarity measure for multivariate time series Proceedings of the 2nd ACM international workshop on Multimedia databases, (65-74)
  210. ACM
    Tomita J, Nakawatase H and Ishii M Calculating similarity between texts using graph-based text representation model Proceedings of the thirteenth ACM international conference on Information and knowledge management, (248-249)
  211. Hammouda K and Kamel M (2004). Efficient Phrase-Based Document Indexing for Web Document Clustering, IEEE Transactions on Knowledge and Data Engineering, 16:10, (1279-1296), Online publication date: 1-Oct-2004.
  212. Ma Q, Kanzaki K, Zhang Y, Murata M and Isahara H (2004). Self-organizing semantic maps and its application to word alignment in Japanese-Chinese parallel corpora, Neural Networks, 17:8-9, (1241-1253), Online publication date: 1-Oct-2004.
  213. Weeds J, Weir D and McCarthy D Characterising measures of lexical distributional similarity Proceedings of the 20th international conference on Computational Linguistics, (1015-es)
  214. Yager R (2004). A framework for multi-source data fusion, Information Sciences: an International Journal, 163:1-3, (175-200), Online publication date: 14-Jun-2004.
  215. ACM
    Harada M, Sato S and Kazama K Finding authoritative people from the web Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, (306-313)
  216. ACM
    Kraft R and Zien J Mining anchor text for query refinement Proceedings of the 13th international conference on World Wide Web, (666-674)
  217. ACM
    Novak J, Raghavan P and Tomkins A Anti-aliasing on the web Proceedings of the 13th international conference on World Wide Web, (30-39)
  218. Trajkova J and Gauch S Improving ontology-based user profiles Coupling approaches, coupling media and coupling languages for information retrieval, (380-390)
  219. Yamamoto A and Ogiso A Similarity of documents based on the vector sequence model Proceedings of the 2004 international conference on Intuitive Human Interfaces for Organizing and Accessing Intellectual Assets, (233-242)
  220. Billerbeck B and Zobel J Questioning query expansion Proceedings of the 15th Australasian database conference - Volume 27, (69-76)
  221. Liu F, Yu C and Meng W (2004). Personalized Web Search For Improving Retrieval Effectiveness, IEEE Transactions on Knowledge and Data Engineering, 16:1, (28-40), Online publication date: 1-Jan-2004.
  222. Bertoldi N and Federico M (2004). Statistical Models for Monolingual and Bilingual Information Retrieval, Information Retrieval, 7:1-2, (53-72), Online publication date: 1-Jan-2004.
  223. Zerfiridis K and Karatza H (2004). Brute force web search for wireless devices using mobile agents, Journal of Systems and Software, 69:1-2, (195-206), Online publication date: 1-Jan-2004.
  224. Gauch S, Chaffee J and Pretschner A (2003). Ontology-based personalized search and browsing, Web Intelligence and Agent Systems, 1:3-4, (219-234), Online publication date: 1-Dec-2003.
  225. ACM
    Westermann U and Klas W (2003). An analysis of XML database solutions for the management of MPEG-7 media descriptions, ACM Computing Surveys, 35:4, (331-373), Online publication date: 1-Dec-2003.
  226. ACM
    Waivio R A study in reading comprehension improvement Proceedings of the 2003 conference on Universal usability, (154-155)
  227. He H, Meng W, Yu C and Wu Z Wise-integrator Proceedings of the 29th international conference on Very large data bases - Volume 29, (357-368)
  228. Modha D and Spangler W (2003). Feature Weighting in k-Means Clustering, Machine Language, 52:3, (217-237), Online publication date: 1-Sep-2003.
  229. Lam C and Stork D Evaluating classifiers by means of test data with noisy labels Proceedings of the 18th international joint conference on Artificial intelligence, (513-518)
  230. Perrone M and Vinciarelli A Markov Model Document Retrieval Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
  231. Park G, Baek Y and Lee H Majority based ranking approach in web image retrieval Proceedings of the 2nd international conference on Image and video retrieval, (111-120)
  232. Weeds J and Weir D A general framework for distributional similarity Proceedings of the 2003 conference on Empirical methods in natural language processing, (81-88)
  233. Umemura Y Very low-dimensional latent semantic indexing for local query regions Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11, (84-91)
  234. ACM
    Nakov P Building an inflectional stemmer for Bulgarian Proceedings of the 4th international conference conference on Computer systems and technologies: e-Learning, (419-424)
  235. Park H, Jeon M and Rosen J (2003). Lower Dimensional Representation of Text Data Based on Centroids and Least Squares, BIT, 43:2, (427-448), Online publication date: 1-Jun-2003.
  236. Shimohata M, Sumita E and Matsumoto Y Retrieving meaning-equivalent sentences for example-based rough translation Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3, (50-56)
  237. ACM
    Lim L, Wang M, Padmanabhan S, Vitter J and Agarwal R Dynamic maintenance of web indexes using landmarks Proceedings of the 12th international conference on World Wide Web, (102-111)
  238. Yang L and Rahi A Dynamic clustering of web search results Proceedings of the 2003 international conference on Computational science and its applications: PartI, (153-159)
  239. Mladenić D and Grobelnik M (2003). Feature selection on hierarchy of web documents, Decision Support Systems, 35:1, (45-87), Online publication date: 1-Apr-2003.
  240. Crestani F (2003). Vocal Access to a Newspaper Archive, Journal of Intelligent Information Systems, 20:2, (161-180), Online publication date: 1-Mar-2003.
  241. ACM
    Kim H and Chan P Learning implicit user interest hierarchy for context in personalization Proceedings of the 8th international conference on Intelligent user interfaces, (101-108)
  242. Salton G and Harman D Information retrieval Encyclopedia of Computer Science, (858-863)
  243. Elovici Y, Shapira B and Kantor P (2003). Using the Information Structure Model to Compare Profile-Based Information Filtering Systems, Information Retrieval, 6:1, (75-97), Online publication date: 1-Jan-2003.
  244. Shieh W, Chen T, Shann J and Chung C (2003). Inverted file compression through document identifier reassignment, Information Processing and Management: an International Journal, 39:1, (117-131), Online publication date: 1-Jan-2003.
  245. ACM
    Elovici Y, Shapira B and Maschiach A A new privacy model for hiding group interests while accessing the Web Proceedings of the 2002 ACM workshop on Privacy in the Electronic Society, (63-70)
  246. ACM
    Liu F, Yu C and Meng W Personalized web search by mapping user queries to categories Proceedings of the eleventh international conference on Information and knowledge management, (558-565)
  247. ACM
    Hsu J, Chen A, Chen H and Liu N The effectiveness study of various music information retrieval approaches Proceedings of the eleventh international conference on Information and knowledge management, (422-429)
  248. Antoniol G, Canfora G, Casazza G, De Lucia A and Merlo E (2002). Recovering Traceability Links between Code and Documentation, IEEE Transactions on Software Engineering, 28:10, (970-983), Online publication date: 1-Oct-2002.
  249. Stuckenschmidt H Approximate Information Filtering on the Semantic Web Proceedings of the 25th Annual German Conference on AI: Advances in Artificial Intelligence, (114-128)
  250. Al-Sughaiyer I and Al-Kharashi I Rule Parser for Arabic Stemmer Proceedings of the 5th International Conference on Text, Speech and Dialogue, (11-18)
  251. Korhonen A and Krymolowski Y On the robustness of entropy-based similarity measures in evaluation of subcategorization acquisition systems proceedings of the 6th conference on Natural language learning - Volume 20, (1-7)
  252. Lee K, Kageura K and Choi K Implicit ambiguity resolution using incremental clustering in Korean-to-English cross-language information retrieval Proceedings of the 19th international conference on Computational linguistics - Volume 1, (1-7)
  253. Ma Q, Zhang M, Murata M, Zhou M and Isahara H Self-organizing Chinese and Japanese semantic maps Proceedings of the 19th international conference on Computational linguistics - Volume 1, (1-7)
  254. Alam H, Cheng H, Hartono R, Kumar A, Llido P, Nakatsu C, Nguyen H, Rahman F, Tarnikova Y, Tjahjadi T and Wilcox C Automatic semantic grouping in a spoken language user interface toolkit Proceedings of the 19th international conference on Computational linguistics - Volume 1, (1-7)
  255. ACM
    Wolin B Automatic classification in product catalogs Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, (351-352)
  256. ACM
    Federico M and Bertoldi N Statistical cross-language information retrieval using n-best query translations Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, (167-174)
  257. ACM
    Lin S and Ho J Discovering informative content blocks from Web documents Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, (588-593)
  258. ACM
    Tejada S, Knoblock C and Minton S Learning domain-independent string transformation weights for high accuracy object identification Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, (350-359)
  259. ACM
    Waivio R (2002). A study in reading comprehension improvement, ACM SIGCAPH Computers and the Physically Handicapped:73-74, (154-155), Online publication date: 17-Jun-2002.
  260. ACM
    Shasha D, Wang J and Giugno R Algorithmics and applications of tree and graph searching Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, (39-52)
  261. Lin S, Chen M, Ho J and Huang Y (2002). ACIRD, IEEE Transactions on Knowledge and Data Engineering, 14:3, (599-614), Online publication date: 1-May-2002.
  262. Crestani F (2002). Spoken query processing for interactive information retrieval, Data & Knowledge Engineering, 41:1, (105-124), Online publication date: 1-Apr-2002.
  263. ACM
    Martinez J and Loisant E Browsing image databases with Galois' lattices Proceedings of the 2002 ACM symposium on Applied computing, (791-795)
  264. ACM
    Kotsakis E Structured information retrieval in XML documents Proceedings of the 2002 ACM symposium on Applied computing, (663-667)
  265. Riggs K (2002). Exploring IR with UNIX tools, Journal of Computing Sciences in Colleges, 17:4, (179-194), Online publication date: 1-Mar-2002.
  266. Baeza-Yates R, Moffat A and Navarro G Searching large text collections Handbook of massive data sets, (195-243)
  267. Lu G (2001). Indexing and Retrieval of Audio, Multimedia Tools and Applications, 15:3, (269-290), Online publication date: 1-Dec-2001.
  268. ACM
    Rapela J Automatically combining ranking heuristics for HTML documents Proceedings of the 3rd international workshop on Web information and data management, (61-67)
  269. ACM
    Cacheda F and Viña A Superimposing codes representing hierarchical information in web directories Proceedings of the 3rd international workshop on Web information and data management, (54-60)
  270. ACM
    Krishna K and Krishnapuram R A clustering algorithm for asymmetrically related data with applications to text mining Proceedings of the tenth international conference on Information and knowledge management, (571-573)
  271. ACM
    Tsikrika T and Lalmas M Merging techniques for performing data fusion on the web Proceedings of the tenth international conference on Information and knowledge management, (127-134)
  272. ACM
    Wong K, Song D, Bruza P and Cheng C (2001). Application of aboutness to functional benchmarking in information retrieval, ACM Transactions on Information Systems, 19:4, (337-370), Online publication date: 1-Oct-2001.
  273. ACM
    Davidson A, Anvik J and Nascimento M Parallel traversal of signature trees for fast CBIR Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrieval, (6-9)
  274. Raghavan S and Garcia-Molina H Crawling the Hidden Web Proceedings of the 27th International Conference on Very Large Data Bases, (129-138)
  275. ACM
    Chávez E, Navarro G, Baeza-Yates R and Marroquín J (2001). Searching in metric spaces, ACM Computing Surveys, 33:3, (273-321), Online publication date: 1-Sep-2001.
  276. Hanani U, Shapira B and Shoval P (2001). Information Filtering, User Modeling and User-Adapted Interaction, 11:3, (203-259), Online publication date: 1-Aug-2001.
  277. ACM
    Vitter J (2001). External memory algorithms and data structures, ACM Computing Surveys, 33:2, (209-271), Online publication date: 1-Jun-2001.
  278. Racine K and Yang Q (2001). Redundancy Detection in Semistructured Case Bases, IEEE Transactions on Knowledge and Data Engineering, 13:3, (513-518), Online publication date: 1-May-2001.
  279. Hersh W (2001). Managing Gigabytes—Compressing and Indexing Documents and Images (Second Edition), Information Retrieval, 4:1, (79-80), Online publication date: 1-Apr-2001.
  280. Kantor P (2001). Foundations of Statistical Natural Language Processing, Information Retrieval, 4:1, (80-81), Online publication date: 1-Apr-2001.
  281. Laber E, Milidiú R and Pessoa A On binary searching with non-uniform costs Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms, (855-864)
  282. Dhillon I and Modha D (2001). Concept Decompositions for Large Sparse Text Data Using Clustering, Machine Language, 42:1-2, (143-175), Online publication date: 1-Jan-2001.
  283. ACM
    Geffet M and Feitelson D Hierarchical indexing and document matching in BoW Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries, (259-267)
  284. ACM
    Liu K, Meng W and Yu C Discovery of similarity computations of search engines Proceedings of the ninth international conference on Information and knowledge management, (290-297)
  285. ACM
    Chaffee J and Gauch S Personal ontologies for web navigation Proceedings of the ninth international conference on Information and knowledge management, (227-234)
  286. ACM
    Tai S, Ong C and Abullah N On designing an automated Malaysian stemmer for the Malay language (poster session) Proceedings of the fifth international workshop on on Information retrieval with Asian languages, (207-208)
  287. ACM
    Theeramunkong T, Sornlertlamvanich V, Tanhermhong T and Chinnan W Character cluster based Thai information retrieval Proceedings of the fifth international workshop on on Information retrieval with Asian languages, (75-80)
  288. ACM
    Tai X, Sasaki M, Tanaka Y and Kita K Improvement of vector space information retrieval model based on supervised learning Proceedings of the fifth international workshop on on Information retrieval with Asian languages, (69-74)
  289. ACM
    Tsay J and Wang J Improving automatic Chinese text categorization by error correction Proceedings of the fifth international workshop on on Information retrieval with Asian languages, (1-8)
  290. Holub M and Böhmová A Use of dependency tree structures for the microcontext extraction Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11, (23-33)
  291. Kawasaki S, Nguyen N and Ho T Hierarchical Document Clustering Based on Tolerance Rough Set Model Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, (458-463)
  292. ACM
    Gavrilov M, Anguelov D, Indyk P and Motwani R Mining the stock market (extended abstract) Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, (487-496)
  293. ACM
    Hatzivassiloglou V, Gravano L and Maganti A An investigation of linguistic features and clustering algorithms for topical document clustering Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, (224-231)
  294. ACM
    Silva I, Ribeiro-Neto B, Calado P, Moura E and Ziviani N Link-based and content-based evidential information in a belief network model Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, (96-103)
  295. ACM
    Kobayashi M and Takeda K (2000). Information retrieval on the web, ACM Computing Surveys, 32:2, (144-173), Online publication date: 1-Jun-2000.
  296. ACM
    Agichtein E and Gravano L Snowball Proceedings of the fifth ACM conference on Digital libraries, (85-94)
  297. ACM
    Oh J and Hua K (2000). Efficient and cost-effective techniques for browsing and indexing large video databases, ACM SIGMOD Record, 29:2, (415-426), Online publication date: 1-Jun-2000.
  298. ACM
    Modha D and Spangler W Clustering hypertext with applications to web searching Proceedings of the eleventh ACM on Hypertext and hypermedia, (143-152)
  299. ACM
    Oh J and Hua K Efficient and cost-effective techniques for browsing and indexing large video databases Proceedings of the 2000 ACM SIGMOD international conference on Management of data, (415-426)
  300. ACM
    Chen Z, Koudas N, Korn F and Muthukrishnan S Selectively estimation for Boolean queries Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, (216-225)
  301. Hughey M and Berry M (2000). Improved Query Matching Using kd-Trees: A Latent Semantic Indexing Enhancement, Information Retrieval, 2:4, (287-302), Online publication date: 1-May-2000.
  302. Menczer F and Belew R (2000). Adaptive Retrieval Agents, Machine Language, 39:2-3, (203-242), Online publication date: 1-May-2000.
  303. Yamashita T and Matsumoto Y Language independent morphological analysis Proceedings of the sixth conference on Applied natural language processing, (232-238)
  304. Melucci M and Orio N Smile Content-Based Multimedia Information Access - Volume 2, (1246-1260)
  305. D'Alessio S, Murray K, Schiaffino R and Kershenbaum A The effect of using hierarchical classifiers in text categorization Content-Based Multimedia Information Access - Volume 1, (302-313)
  306. Landoni M, Crestani F and Melucci M The Visual Book and the Hyper-TextBook Content-Based Multimedia Information Access - Volume 1, (247-265)
  307. Dunlop M and McDonald K Supporting different search strategies in a video query interface Content-Based Multimedia Information Access - Volume 1, (21-31)
  308. ACM
    Pereira F and Costa E The influence of learning in the behavior of information retrieval adaptive agents Proceedings of the 2000 ACM symposium on Applied computing - Volume 1, (452-457)
  309. Crestani F (2000). Exploiting the Similarity of Non-Matching Terms at Retrieval Time, Information Retrieval, 2:1, (27-47), Online publication date: 1-Feb-2000.
  310. ACM
    Chakrabarti S (2000). Data mining for hypertext, ACM SIGKDD Explorations Newsletter, 1:2, (1-11), Online publication date: 1-Jan-2000.
  311. Chakrabarti S, Dom B, Gibson D, Kumar R, Raghavan P, Rajagopalan S and Tomkins A (1999). Topic Distillation and Spectral Filtering, Artificial Intelligence Review, 13:5-6, (409-435), Online publication date: 1-Dec-1999.
  312. ACM
    Adar E, Karger D and Stein L Haystack Proceedings of the eighth international conference on Information and knowledge management, (413-422)
  313. ACM
    Hsu W and Lang S Classification algorithms for NETNEWS articles Proceedings of the eighth international conference on Information and knowledge management, (114-121)
  314. ACM
    Tosukhowong P, Andres F, Ono K, Dessaigne N, Martinez J, Mouaddib N and Schmidt D A flexible image search engine Proceedings of the seventh ACM international conference on Multimedia (Part 2), (87-90)
  315. Jiang J, Berry M, Donato J, Ostrouchov G and Grady N (1999). Mining consumer product data via latent semantic indexing, Intelligent Data Analysis, 3:5, (377-398), Online publication date: 1-Sep-1999.
  316. Hsu W and Lang S Feature Reduction and Database Maintenance in NETNEWS Classification Proceedings of the 1999 International Symposium on Database Engineering & Applications
  317. ACM
    Melucci M and Orio N Musical information retrieval using melodic surface Proceedings of the fourth ACM conference on Digital libraries, (152-160)
  318. ACM
    Tseng Y Content-based retrieval for music collections Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, (176-182)
  319. ACM
    de Kretser O and Moffat A Effective document presentation with a locality-based similarity heuristic Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, (113-120)
  320. ACM
    Aggarwal C, Wolf J and Yu P A new method for similarity indexing of market basket data Proceedings of the 1999 ACM SIGMOD international conference on Management of data, (407-418)
  321. ACM
    Aggarwal C, Wolf J and Yu P (1999). A new method for similarity indexing of market basket data, ACM SIGMOD Record, 28:2, (407-418), Online publication date: 1-Jun-1999.
  322. ACM
    Michail A and Notkin D Assessing software libraries by browsing similar classes, functions and relationships Proceedings of the 21st international conference on Software engineering, (463-472)
  323. ACM
    Kleinberg J and Tomkins A Applications of linear algebra in information retrieval and hypertext analysis Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, (185-193)
  324. ACM
    Indyk P Sublinear time algorithms for metric space problems Proceedings of the thirty-first annual ACM symposium on Theory of Computing, (428-434)
  325. Chow J, Cheng J, Chang D and Xu J Index Design for Structured Documents Based on Abstraction Proceedings of the Sixth International Conference on Database Systems for Advanced Applications, (89-96)
  326. Fragoudis D and Likothanassis S Retriever Proceedings of the 20th international conference on Information Systems, (422-427)
  327. Cole R, Hariharan R and Indyk P Tree pattern matching and subset matching in deterministic O(n log3 n)-time Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms, (245-254)
  328. Boley D (1998). Principal Direction Divisive Partitioning, Data Mining and Knowledge Discovery, 2:4, (325-344), Online publication date: 1-Dec-1998.
  329. ACM
    Xu M and Gauch S Associated biological information retrieval from distributed databases Proceedings of the seventh international conference on Information and knowledge management, (193-200)
  330. ACM
    de Lima L, Laender A and Ribeiro-Neto B A hierarchical approach to the automatic categorization of medical documents Proceedings of the seventh international conference on Information and knowledge management, (132-139)
  331. ACM
    Embley D, Campbell D, Smith R and Liddle S Ontology-based extraction and structuring of information from data-rich unstructured documents Proceedings of the seventh international conference on Information and knowledge management, (52-59)
  332. Wills G An Interactive View for Hierarchical Clustering Proceedings of the 1998 IEEE Symposium on Information Visualization
  333. Barry Crabtree I and Soltysiak S (1998). Identifying and tracking changing interests, International Journal on Digital Libraries, 2:1, (38-53), Online publication date: 1-Oct-1998.
  334. Fung P and Yee L An IR approach for translating new words from nonparallel, comparable texts Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1, (414-420)
  335. Chen K, Tsuei W and Chien L PAT-trees with the deletion function as the learning device for linguistic patterns Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1, (244-250)
  336. Lin D Automatic retrieval and clustering of similar words Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 2, (768-774)
  337. ACM
    Tseng Y Multilingual keyword extraction for term suggestion Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, (377-378)
  338. ACM
    Vo A and Moffat A Compressed inverted files with reduced decoding overheads Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, (290-297)
  339. ACM
    Lin S, Shih C, Chen M, Ho J, Ko M and Huang Y Extracting classification knowledge of Internet documents with mining term associations Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, (241-249)
  340. ACM
    Cazals F Effective nearest neighbors searching on the hyper-cube, with applications to molecular clustering Proceedings of the fourteenth annual symposium on Computational geometry, (222-230)
  341. ACM
    Chen J, Wong L and Zhang L (1998). A protein patent query system powered by Kleisli, ACM SIGMOD Record, 27:2, (593-595), Online publication date: 1-Jun-1998.
  342. ACM
    Chen J, Wong L and Zhang L A protein patent query system powered by Kleisli Proceedings of the 1998 ACM SIGMOD international conference on Management of data, (593-595)
  343. ACM
    Indyk P and Motwani R Approximate nearest neighbors Proceedings of the thirtieth annual ACM symposium on Theory of computing, (604-613)
  344. ACM
    Xu J, Cao Y, Lim E and Ng W Database selection techniques for routing bibliographic queries Proceedings of the third ACM conference on Digital libraries, (264-274)
  345. ACM
    Ribeiro-Neto B and Barbosa R Query performance for tightly coupled distributed digital libraries Proceedings of the third ACM conference on Digital libraries, (182-190)
  346. ACM
    Kanada Y Axis-specified search Proceedings of the third ACM conference on Digital libraries, (108-117)
  347. ACM
    Han E, Boley D, Gini M, Gross R, Hastings K, Karypis G, Kumar V, Mobasher B and Moore J WebACE Proceedings of the second international conference on Autonomous agents, (408-415)
  348. ACM
    Menczer F and Belew R Adaptive information agents in distributed textual environments Proceedings of the second international conference on Autonomous agents, (157-164)
  349. ACM
    Boone G Concept features in Re:Agent, an intelligent Email agent Proceedings of the second international conference on Autonomous agents, (141-148)
  350. Gudivada V (1998). TR$\Re$-String, IEEE Transactions on Knowledge and Data Engineering, 10:3, (504-512), Online publication date: 1-May-1998.
  351. ACM
    Ogawa Y and Matsuda T (1997). Overlapping statistical word indexing, ACM SIGIR Forum, 31:SI, (226-234), Online publication date: 2-Dec-1997.
  352. ACM
    Vélez B, Weiss R, Sheldon M and Gifford D (1997). Fast and effective query refinement, ACM SIGIR Forum, 31:SI, (6-15), Online publication date: 2-Dec-1997.
  353. Crestani F (1997). Application of Spreading Activation Techniques in InformationRetrieval, Artificial Intelligence Review, 11:6, (453-482), Online publication date: 1-Dec-1997.
  354. Goldin L and Berry D (1997). AbstFinder, A Prototype Natural Language Text Abstraction Finder for Use in Requirements Elicitation, Automated Software Engineering, 4:4, (375-412), Online publication date: 1-Oct-1997.
  355. Chakrabarti S, Dom B, Agrawal R and Raghavan P Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases Proceedings of the 23rd International Conference on Very Large Data Bases, (446-455)
  356. ACM
    Chang C and García-Molina H Evaluating the cost of Boolean query mapping Proceedings of the second ACM international conference on Digital libraries, (103-112)
  357. ACM
    Ogawa Y and Matsuda T Overlapping statistical word indexing Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, (226-234)
  358. ACM
    Vélez B, Weiss R, Sheldon M and Gifford D Fast and effective query refinement Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, (6-15)
  359. Chandrasekar R and Srinivas B Using syntactic information in document filtering Computer-Assisted Information Searching on Internet, (531-545)
  360. Wechsler M, Sheridan P and Schäuble P Multi-language text indexing for internet retrieval Computer-Assisted Information Searching on Internet, (217-232)
  361. ACM
    Charikar M, Chekuri C, Feder T and Motwani R Incremental clustering and dynamic information retrieval Proceedings of the twenty-ninth annual ACM symposium on Theory of computing, (626-635)
  362. ACM
    Das Neves F The Aleph Proceedings of the eighth ACM conference on Hypertext, (197-207)
  363. ACM
    Si A, Leong H and Lau R CHECK Proceedings of the 1997 ACM symposium on Applied computing, (70-77)
  364. Lee K, Lee Y and Berra P (1997). Management of Multi-structured Hypermedia Documents, Multimedia Tools and Applications, 4:2, (199-223), Online publication date: 1-Mar-1997.
  365. Oard D (1997). The State of the Art in Text Filtering, User Modeling and User-Adapted Interaction, 7:3, (141-178), Online publication date: 1-Mar-1997.
  366. Raghavan P Information retrieval algorithms Proceedings of the eighth annual ACM-SIAM symposium on Discrete algorithms, (11-18)
  367. Ogawa Y Effective & Efficient Document Ranking without using a Large Lexicon Proceedings of the 22th International Conference on Very Large Data Bases, (192-202)
  368. Chu W and Yang H A Formal Method to Software Integration in Reuse Proceedings of the 20th Conference on Computer Software and Applications
  369. Chang K, Garcia-Molina H and Paepcke A (1996). Boolean Query Mapping Across Heterogeneous Information Sources, IEEE Transactions on Knowledge and Data Engineering, 8:4, (515-521), Online publication date: 1-Aug-1996.
  370. ACM
    Lee Y, Yoo S, Yoon K and Berra P Index structures for structured documents Proceedings of the first ACM international conference on Digital libraries, (91-99)
  371. Smadja F, McKeown K and Hatzivassiloglou V (1996). Translating collocations for bilingual lexicons, Computational Linguistics, 22:1, (1-38), Online publication date: 1-Mar-1996.
  372. ACM
    Baeza-Yates R and Navarro G (1996). Integrating contents and structure in text retrieval, ACM SIGMOD Record, 25:1, (67-79), Online publication date: 1-Mar-1996.
  373. ACM
    Succi G, Baruchelli F and Ronchetti M A taxonomy for identifying a software component for uncertain and partial specifications Proceedings of the 1996 ACM symposium on Applied Computing, (570-579)
  374. ACM
    Baeza-Yates R (1995). Teaching algorithms, ACM SIGACT News, 26:4, (51-59), Online publication date: 1-Dec-1995.
  375. ACM
    Church K and Rau L (1995). Commercial applications of natural language processing, Communications of the ACM, 38:11, (71-79), Online publication date: 1-Nov-1995.
  376. ACM
    Riloff E Little words can make a big difference for text classification Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, (130-136)
  377. ACM
    Navarro G and Baeza-Yates R A language for queries on structure and contents of textual databases Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, (93-101)
  378. Persin M Document filtering for fast ranking Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, (339-348)
  379. Shoens K, Tomasic A and García-Molina H Synthetic workload performance analysis of incremental updates Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, (329-338)
  380. Aalbersberg I A document retrieval model based on term frequency ranks Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, (163-172)
  381. Frakes W and Pole T (1994). An Empirical Study of Representation Methods for Reusable Software Components, IEEE Transactions on Software Engineering, 20:8, (617-630), Online publication date: 1-Aug-1994.
  382. Quin L A text retrieval package for the unix operating system Proceedings of the USENIX Summer 1994 Technical Conference on USENIX Summer 1994 Technical Conference - Volume 1, (16-16)
  383. ACM
    Tomasic A, García-Molina H and Shoens K (1994). Incremental updates of inverted lists for text document retrieval, ACM SIGMOD Record, 23:2, (289-300), Online publication date: 1-Jun-1994.
  384. ACM
    Tomasic A, García-Molina H and Shoens K Incremental updates of inverted lists for text document retrieval Proceedings of the 1994 ACM SIGMOD international conference on Management of data, (289-300)
  385. Smadja F and McKeown K Translating collocations for use in bilingual lexicons Proceedings of the workshop on Human Language Technology, (152-156)
  386. ACM
    Perlman G Information retrieval techniques for hypertext in the semi-structured toolkit Proceedings of the fifth ACM conference on Hypertext, (260-267)
  387. Chinchor N and Sundheim B MUC-5 evaluation metrics Proceedings of the 5th conference on Message understanding, (69-78)
  388. ACM
    Fujii H and Croft W A comparison of indexing techniques for Japanese text retrieval Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, (237-246)
  389. ACM
    Tomasic A and Garcia-Molina H (1993). Caching and database scaling in distributed shared-nothing information retrieval systems, ACM SIGMOD Record, 22:2, (129-138), Online publication date: 1-Jun-1993.
  390. ACM
    Tomasic A and Garcia-Molina H Caching and database scaling in distributed shared-nothing information retrieval systems Proceedings of the 1993 ACM SIGMOD international conference on Management of data, (129-138)
  391. Church K and Mercer R (1993). Introduction to the special issue on computational linguistics using large corpora, Computational Linguistics, 19:1, (1-24), Online publication date: 1-Mar-1993.
  392. Chinchor N MUC-4 evaluation metrics Proceedings of the 4th conference on Message understanding, (22-29)
Contributors
  • Syracuse University
  • University of Chile

Index Terms

  1. Information retrieval: data structures and algorithms

      Recommendations

      Reviews

      Gerard Salton

      The area of text analysis, search, and retrieval has taken on increasing importance in recent years, and the field is now of interest to large communities in science and in the humanities. The need for a volume covering the major information retrieval algorithms has been apparent for many years, and the authors and editors of this book ought to be congratulated for devoting much time and effort to this important area. This book consists of separate chapters by some 20 different well-qualified authors, and it covers many of the more important information retrieval algorithms, including methods of file organization, file search and access, and query processing. The book has a practical outlook, and it should be of substantial help to people interested in information retrieval applications. Some of the chapters provide a reasonable overview of the areas they cover as well as a decent bibliography. Unhappily, as is true of many such collections of individual chapters, the overall effect appears in many ways to be less than the sum of its parts. First, the treatment of the field is noticeably uneven. Some subjects are covered only cursorily; for example, text analysis, portions of which are included in several different chapters, is not treated with any real insight. Some other subjects are emphasized far beyond their real importance—for example, a whole chapter is devoted to PAT trees, a data structure used in searching the Oxford English Dictionary . Digital search trees could have been covered more reasonably in two or three pages. The same is true of string searching, which is not applicable to the large information files that are now prevalent, although it is expertly covered in the chapter by Baeza-Yates. The book also unfortunately lacks glue between various related subjects, because the central agent that could have related different parts of the book is absent. The two short initial chapters that were obviously designed to provide this connection are not effective in this respect. An editorial decision that substantially contributes to the choppiness of the material is the use of major topic subdivisions entitled “File structure,” “Term and Query Operations,” and “Document Operations.” In practice, user queries are often available in the form of natural-language statements that are not immediately distinguishable from the stored document representations. In such circumstances, the distinct treatment of queries and documents produces unfortunate conceptual problems. Thus, relevance feedback—a query modification operation that occurs at the tail-end of the processing chain—is described in chapter 11, whereas some basic text indexing operations used at the beginning of the retrieval process are treated as document operations and appear in chapter 14. This discontinuity leads Donna Harman, the author of both of these chapters, to suggest that chapter 14 be read before chapter 11. For all these reasons, it is difficult to recommend this book to novices who many not be in a position to provide the needed context. Something more integrated, with a more compelling structured treatment of the field, might have been more useful. The current mix of some nice survey chapters, notably on signature files, string searching, relevance feedback, ranking, and clustering, with other more narrowly conceived topic treatments is difficult to manage even for people who know the field well. When a book is written by many different people with different outlooks and approaches, a careful editing job is essential. The treatment offered here leaves a lot to be desired. First, the book has many typos. The chapter heads are not evenly treated: sometimes a full address is given for an author as part of the chapter heading; other times only a company affiliation is given. Some authors use the third person in covering their subject; others prefer the first person plural, even when only a single author is involved. Most distressing is the lack of a decent index. The book has no author index, and no index of retrieval system acronyms. A subject index is included, but many important concepts in retrieval, such as classification, dictionary, knowledge base, hypertext, multimedia, and query formulation, do not appear in the index. Overall, this book fulfills a real need for the practitioner. It includes many nice chapters written by capable contributors. The volume could have been more useful, however, if more attention had been paid to the overall organization and if tighter editing and more careful production had prevailed.

      Access critical reviews of Computing literature here

      Become a reviewer for Computing Reviews.