LLMpediaThe first transparent, open encyclopedia generated by LLMs

CiteSeer

Generated by Llama 3.3-70B
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Google Scholar Hop 3
Expansion Funnel Raw 136 → Dedup 41 → NER 3 → Enqueued 2
1. Extracted136
2. After dedup41 (None)
3. After NER3 (None)
Rejected: 38 (parse: 38)
4. Enqueued2 (None)
Similarity rejected: 1
CiteSeer
NameCiteSeer

CiteSeer is a public search engine and digital library for scientific literature, particularly in the fields of computer science, information technology, and mathematics, developed by Pennsylvania State University. It was created to facilitate research and academic publishing by providing a comprehensive database of research papers and academic journals, including those from IEEE, ACM, and Springer Science+Business Media. CiteSeer was also designed to promote open access to scientific knowledge and to support the work of researchers at institutions such as Massachusetts Institute of Technology, Stanford University, and University of California, Berkeley. The development of CiteSeer was influenced by the work of Tim Berners-Lee, Vint Cerf, and Jon Postel, who played key roles in the creation of the Internet and the World Wide Web.

Introduction

CiteSeer is an automated citation indexing system that provides bibliographic information and abstracts for research articles and conference papers from a wide range of academic publishers, including Elsevier, Wiley-Blackwell, and Oxford University Press. The system uses natural language processing and machine learning techniques to extract metadata from PDF files and to identify citations and references within the text, similar to the approaches used by Google Scholar and Microsoft Academic. CiteSeer also provides links to full-text versions of the articles, when available, and allows users to search and browse the database using keywords, authors, and publications from institutions such as Harvard University, University of Oxford, and California Institute of Technology. The system has been used by researchers at NASA, European Organization for Nuclear Research, and National Institutes of Health to find and access relevant literature.

History

The development of CiteSeer began in the late 1990s at Pennsylvania State University, with the goal of creating a comprehensive database of computer science literature, including publications from ACM Special Interest Group on Management of Data and IEEE Computer Society. The system was initially called CiteSeer.IST and was launched in 1998, with a focus on indexing and searching research papers from conferences and journals such as SIGMOD, VLDB, and ICDE. Over time, the system expanded to include literature from other fields, such as mathematics and information technology, and was renamed CiteSeerX in 2008, with the support of National Science Foundation and Institute of Electrical and Electronics Engineers. CiteSeerX has since become a widely used resource for researchers and students at universities such as University of Cambridge, University of Tokyo, and École Polytechnique Fédérale de Lausanne.

Features

CiteSeer provides a range of features to support research and academic publishing, including automated citation indexing, full-text searching, and bibliographic information for research articles and conference papers from publishers such as Springer Nature and Wiley-VCH. The system also includes tools for authors to upload and manage their publications, as well as metrics for tracking citations and usage of research papers, similar to those used by ResearchGate and Academia.edu. CiteSeer also provides links to related publications and authors, and allows users to export bibliographic information in formats such as BibTeX and EndNote, which are also used by Mendeley and Zotero. The system has been used by researchers at institutions such as European Space Agency, Los Alamos National Laboratory, and University of Melbourne to find and access relevant literature.

Impact

CiteSeer has had a significant impact on the way researchers and students access and use scientific literature, particularly in the fields of computer science and information technology, with institutions such as Carnegie Mellon University, University of Edinburgh, and National University of Singapore using the system to support their research activities. The system has been widely adopted by universities and research institutions around the world, including University of California, Los Angeles, University of Illinois at Urbana-Champaign, and Georgia Institute of Technology, and has been recognized as a valuable resource for research and academic publishing by organizations such as Association for Computing Machinery and Institute of Electrical and Electronics Engineers. CiteSeer has also been used by researchers at NASA Jet Propulsion Laboratory, European Molecular Biology Laboratory, and National Center for Biotechnology Information to find and access relevant literature.

Technical Details

CiteSeer is built using a range of technologies, including Java, Python, and Apache Solr, and uses a database management system to store and manage the metadata and full-text content of the research papers and conference papers from publishers such as Elsevier, Wiley-Blackwell, and Oxford University Press. The system uses natural language processing and machine learning techniques to extract metadata from PDF files and to identify citations and references within the text, similar to the approaches used by Google Scholar and Microsoft Academic. CiteSeer also uses algorithms to rank and filter search results, and provides tools for users to customize their search experience, which are also used by ResearchGate and Academia.edu. The system has been used by researchers at institutions such as Stanford Linear Accelerator Center, Fermilab, and Brookhaven National Laboratory to find and access relevant literature.

Comparison to Other Services

CiteSeer is one of several digital libraries and search engines available for scientific literature, including Google Scholar, Microsoft Academic, and Semantic Scholar, which are also used by researchers at institutions such as Harvard University, University of Oxford, and California Institute of Technology. While these systems share some similarities with CiteSeer, they also have some key differences in terms of their features, coverage, and functionality, with Google Scholar providing a broader range of literature and Microsoft Academic providing more advanced search and filtering options. CiteSeer is unique in its focus on automated citation indexing and its ability to provide detailed bibliographic information and metrics for research papers and conference papers from publishers such as Springer Nature and Wiley-VCH. The system has been used by researchers at institutions such as European Organization for Nuclear Research, National Institutes of Health, and University of Melbourne to find and access relevant literature. Category:Digital libraries