How to cite data
There is no significant difference between citing datasets and citing publications. It is important to credit the author(s) and refer to the research dataset by including the DOI assigned to the dataset in the archive.
The dataset should be cited according to the applicable citation style in the text. It is often appropriate to include a reference in the methodology chapter, and the reference should be placed in the reference list.
Some may also choose to include the dataset reference in a data availability statement if the journal requires such a statement.
A reference to a dataset should include the following information:
- Author: Person(s) or organization that created or collected the data
- Title of the dataset
- Publication date
- Archive/Publisher
- Identifier: DOI or another persistent identifier for the dataset or individual files
- Version number
Citing and archiving your own data
By archiving the data on which your publications are based and citing these datasets, you create a link between your publication and the data, making it easier for others to locate your data. Read more about data archiving.
Recommendations and tools
Several data repositories provide recommendations on how to cite datasets.
Reference management tools like Zotero allow you to choose "dataset" as the material type, and the reference will then be formatted correctly based on the style you select. Read more about Zotero.