• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

Connecting Researchers and Research

Sign in/Register
  • About
        • Our Mission
          • 2025 Vision
          • ORCID Trust
          • Annual Reports
        • Team
          • ORCID Board
          • ORCID Team
          • Work with Us!
        • Services
          • Member Portal
          • Annual data files
          • Member API
          • Public API
          • ORCID Registry
        • Governance
          • Bylaws
          • Board Elections
        • Policies
          • Privacy Policy
          • Dispute Procedures
          • Public Client Terms of Service
          • Open Source Project License
          • Public Data File Use Policy
          • Terms of Use
          • Brand Guidelines
  • For Researchers
        • Benefits for Researchers
        • Researcher FAQ
        • Video Tutorials
        • Sign in / Register
        • Get Help
  • Membership
        • Membership Benefits
          • Benefits for Funders
          • Benefits for Publishers
          • Benefits for Research Organizations
          • Benefits for Research Resources
        • Get Membership
        • Member List
        • ORCID Map
        • Membership Comparison
          • Basic Membership
          • Premium Membership
          • ORCID Consortia
        • ORCID Consortia
          • Consortia Agreement
          • Consortia Onboarding Checklist
          • Roles and Responsibilities of ORCID Consortia
  • Documentation
        • Features
          • Member Portal
          • Member API
          • Public API
          • ORCID Registry
          • Annual Data Files
        • Workflows
          • Journal Articles
          • Employment
          • Peer Review
          • Funder and Grants
          • View More
        • Integration Guide
          • Getting Started with Your Integration
          • Sandbox Testing Server
          • Registering a Member API Client
          • Integration and API FAQ
          • View More
        • API Tutorials
          • Get an Authenticated ORCID iD
          • Read Data on a Record
          • Add and Update Data on an ORCID record
          • Hands On with the ORCID API
          • View More
  • Resources
        • ORCID Community
        • Community Programs
          • Certified Service Providers
          • ORCID API Users Group
          • Historical Task Forces, Working Groups, and Steering Groups
        • Get Involved
          • Community Groups
          • Developers
          • Give Feedback
          • ORCID API Users Group
        • Member Resources
          • ORCID Enabled Systems
          • Publishers Open Letter
          • Funders Open Letter
          • Standard Member Agreement
          • Outreach Resources
          • Register a Sandbox API Client
          • Register a Production API Client
  • News & Events
        • News
          • ORCID News
          • Member News
          • Consortia News
          • Integration News
          • Blog
          • Release Notes
        • Events
          • Events Calendar
          • Webinars
You are here: Home / Documentation / Integration Guide / Working with bulk data

Working with bulk data

Contents
1. Download the public data file
2. Software to access the file
3. Process
4. Using the public file

ORCID shares a Public Data File at least annually on the anniversary of our initial launch in October 2012. Sharing the annual public data file is one of our key principles. We release the file to ensure that all stakeholders, including organizations that are not members of ORCID, have broad access to what we hope becomes a vital part of the scholarly communication infrastructure.

Use of data in the file is governed by the data file terms of use and our Privacy Policy (excerpt below):

ORCID is releasing the Public Data File under a CC0 1.0 Public Domain Dedication developed by Creative Commons. Accordingly, ORCID will not impose restrictions or conditions (including those contained in the Terms and Conditions of Use and the Membership Agreement) on use of the Public Data File, but has posted recommended community norms for use.

Download the public data file

Visit the below links to download the annual public data file.

  • 2020 ORCID Public Data File
  • 2019 ORCID Public Data File
  • 2018 ORCID Public Data File
  • 2017 ORCID Public Data File
  • 2016 ORCID Public Data File
  • 2015 ORCID Public Data File
  • 2014 ORCID Public Data File
  • 2013 ORCID Public Data File
CC0

To the extent possible under law, ORCID, Inc has waived all copyright and related or neighboring rights to ORCID Public Data File. This work is published from: United States.

Software to access the file

  • Windows: A tool to unpack tar files such as Winrar or 7zip
  • Mac: No additional software needed
  • XML-JSON conversion: From 2018, the file is provided only in XML format. Read on to learn how to generate JSON versions of the file.

Process

  1. Download the file from the links above. Each year’s data file may include multiple tar.gz files. For example, the 2018 file includes one tar.gz archive for all ORCID records and one tar.gz archive for all record activities.
  2. Windows: Use the tool (Winrar or 7zip) that you downloaded to unzip the tar.gz file; this will result in a single .tar file (which may appear with no extension). You may be required to run the tool a second time on the .tar file to unpack it.
    Mac:
     Double-click the tar.gz file to unzip it; this will result in a single .tar file (which may appear with no extension). Double-click the .tar file to unpack it.
  3. The output folder for each file will differ depending on the year that the file was generated and XSD version.
    1. 2013-2017: Inside the generated folder you will find multiple folders, for example json/ and xml/. Inside each folder is one file for each ORCID record in the specified format and XSD version.
    2. 2018+, records file: Inside the generated folder you will find one folder, summary/, which contains multiple folders containing individual ORCID records in XML format. The records are aggregated into subfolders based on a shared final three digits of the ORCID iD.
    3. 2018+, activity file: Inside the generated folder you will find multiple folders for each ORCID record. Each folder will include the full activities on each ORCID record in XML format, separated by activity subsection.

Using the public file

The file contains the public information associated with each user’s ORCID record. Each record is included as a separate file in both JSON and XML in the 2013-2017 files. In the 2018 file, each record is included as a separate file in XML, and each full activity section for each record is included as a separate file in XML. For those who prefer JSON, use the ORCID Conversion Library to convert the XML files to JSON.

For the XSD required to interpret the files, see the ORCID GitHub repository.

Please see Syncronizing with ORCID on ways you can develop your integration to enable you to keep track of researchers and their activities.

Primary Sidebar

Documentation

Documentation MenuDocumentation Menu
  • Features
    • Public API
      • Sign in option to your system
      • Collecting authenticated iDs
      • Reading ORCID records
      • Searching the registry
    • Member API
      • Access trusted data
      • Writing data to ORCID records
      • Syncing ORCID with your system
    • Member Portal
    • ORCID Registry
    • Annual Data Files
  • Workflows
    • Journal Articles
    • Books
    • Preprint Servers
    • Peer Review
    • Education
    • Employment
    • Invited Positions and Distinction
    • Membership and Service
    • Research Resources
    • Funder and Grants
    • Repository Systems
    • eTheses and Dissertation
    • Institutional Connect
    • Search and Link Wizards
    • Sharing User Permissions
    • Connecting with Co-Authors
  • Integration Guide
    • Getting Started with Your Integration
    • Sandbox Testing Server
    • Registering a Public API Client
    • Registering a Member API Client
    • ORCID and Persistent Identifiers
    • Working with Organization Identifiers
    • Sign into ORCID with Institutional Credentials
    • Sign In Using ORCID Credentials
    • Syncronizing with ORCID
    • Working with Bulk Data
    • ORCID Record Schema
    • Troubleshooting
    • User experience display guidelines
    • Customizing the OAuth Experience
    • Vendor Systems
    • Communicating with Users
    • Integration and API FAQ
  • API Tutorials
    • Getting an Authenticated ORCID iD
    • Reading Data on a Record
    • Adding and Updating Data
    • Searching the ORCID Registry
    • Registering a Notification Webhook
    • Hands on with the ORCID API

Github Documentation

Documentation MenuDocumentation Menu
  • API Tech Docs
  • Tech Tutorials
  • Content Negotiation
  • Open ID Connect
  • V3.0 API Schemas
ORCID logo

CC0 The text of this website is published under a CC0 license Images and marks are subject to copyright and trademark protection.

  • About ORCID
  • Privacy Policy
  • Terms of Use
  • Accessibility Statement
  • Contact us
  • Dispute procedures
  • Brand Guidelines
ORCID uses cookies to improve your experience and to help us understand how you use our websites. Learn more about how we use cookies. Dismiss