LLMpediaThe first transparent, open encyclopedia generated by LLMs

Wharton Research Data Services

Generated by GPT-5-mini
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: MSCI World Hop 5
Expansion Funnel Raw 105 → Dedup 0 → NER 0 → Enqueued 0
1. Extracted105
2. After dedup0 (None)
3. After NER0 ()
4. Enqueued0 ()
Wharton Research Data Services
NameWharton Research Data Services
TypeResearch data service
Founded1920s origins (Wharton School data activities)
OwnerUniversity of Pennsylvania
HeadquartersPhiladelphia, Pennsylvania
Website(not displayed)

Wharton Research Data Services is a commercial research data platform operated by the University of Pennsylvania's Wharton School that aggregates historical and contemporary datasets used in empirical studies in finance, accounting, management, and economics. The service integrates licensed databases, proprietary extracts, and curated datasets to support faculty at institutions such as the University of California, Berkeley, Harvard University, Stanford University, Massachusetts Institute of Technology, and London School of Economics. It serves researchers affiliated with institutions including Columbia University, Yale University, Princeton University, University of Chicago, and New York University.

Overview

Wharton Research Data Services operates as a centralized access point for datasets from major vendors like CRSP, Compustat, Thomson Reuters, Refinitiv, and Bureau van Dijk, while coordinating licensing relationships with organizations such as S&P Global, Moody's, Bloomberg L.P., and FactSet Research Systems. Academic clients include departments at Northwestern University, Duke University, University of Michigan, University of Pennsylvania, and Cornell University, and the platform supports research informing journals such as the Journal of Finance, American Economic Review, Quarterly Journal of Economics, Review of Financial Studies, and Journal of Accounting Research. Administratively it interacts with institutional offices like the Office of Research Administration (ORA) at various campuses and with consortia such as the Association of American Universities.

Data Products and Coverage

The product set bundles time series and panel data from vendors including Center for Research in Security Prices (CRSP), Compustat North America, Compustat Global, CRSP/Compustat Merged Database, and macroeconomic series from sources like the Federal Reserve Bank of St. Louis's FRED. Coverage spans equity prices, corporate fundamentals, mutual fund returns from Lipper, bond indices from ICE Data Services, and analyst estimates from I/B/E/S. Additional content includes patent data from the United States Patent and Trademark Office, merger records aligned with Securities and Exchange Commission filings, and governance indicators drawn from datasets maintained by Institutional Shareholder Services and Glass Lewis. Sector and industry taxonomies reference standards from North American Industry Classification System (NAICS) and Global Industry Classification Standard (GICS).

Access, Licensing, and Subscription Models

Access models are governed by licensing agreements often negotiated between publishers such as S&P Global Market Intelligence and subscribing institutions like California Institute of Technology and Johns Hopkins University. Subscription tiers vary by campus unit—libraries at institutions like University of Washington often manage campus-wide licenses while individual departments at Boston University or Brown University may maintain restricted access. Licenses incorporate terms from vendors including Refinitiv, Moody's Analytics, Bureau van Dijk, and Thomson Financial, and must comply with institutional policies used by offices such as Office of Sponsored Programs and Legal Counsel at universities. Authentication methods include campus federated login via Shibboleth, single sign-on integrations with CAS (Central Authentication Service), and IP-range access controlled by libraries like the New York Public Library.

Research and Academic Use Cases

Researchers at institutions including MIT Sloan School of Management, Columbia Business School, Yale School of Management, and Kellogg School of Management use the service for empirical studies published in outlets such as Management Science, Strategic Management Journal, and Journal of Financial Economics. Use cases include asset pricing research referencing work by scholars at University of Chicago Booth School of Business and Harvard Business School, corporate finance analyses consistent with methodologies from NBER, event-study designs using datasets cataloged by Cronin-style repositories, and cross-country studies leveraging indicators from World Bank and International Monetary Fund. Graduate programs at London Business School and INSEAD also use the platform for classroom exercises and dissertation projects.

Technical Infrastructure and Data Delivery

The platform delivers data through secure file transfer protocols and web-based query tools integrated with statistical packages such as StataCorp LLC, R Project for Statistical Computing, Python (programming language), and SAS Institute. Backend operations leverage cloud and on-premises resources similar to deployments from Amazon Web Services, Microsoft Azure, and Google Cloud Platform, while institutional network integrations reference services like Internet2. Data normalization and metadata practices align with standards promulgated by organizations such as Data Documentation Initiative and the Open Archives Initiative, and delivery formats include CSV, fixed-width, and relational schemas compatible with PostgreSQL, MySQL, and SQLite.

History and Development

Origins trace to archival data efforts at the Wharton School and collaborations with entities like Center for Research in Security Prices and Compustat during the late 20th century, with formalization into a hosted service influenced by trends at institutions such as University of California, Los Angeles and Stanford University. Over time it expanded through vendor partnerships with firms including S&P Global, Thomson Reuters, and Bureau van Dijk, and through service models adopted by academic data centers at Harvard University Library and Yale University Library. Governance has involved university units such as Provost Office and library consortia like OCLC.

Criticisms and Limitations

Critiques from faculty at institutions such as University of Chicago, Princeton University, and Columbia University focus on high subscription costs comparable to licensing disputes involving Elsevier and concerns about vendor lock-in similar to debates around ProQuest and JSTOR. Other limitations cited in forums among researchers at European University Institute and Australian National University include lag in data updates when compared to direct feeds from Bloomberg L.P. or Refinitiv, constraints imposed by vendor licensing like those from IHS Markit, and technical issues with integrations noted by IT teams at University of Illinois Urbana–Champaign. Calls for open access alternatives reference initiatives led by Open Data Institute and repositories such as Harvard Dataverse.

Category:University of Pennsylvania