Consider the time period -- current or historical, and the format -- table to print or data to download to a spreadsheet.
Integrated Public Use Microdata Series (IPUMS) is the world's largest individual-
Data page: ATUS Data Files - https://www.bls.gov/tus/#data
select single or multi year files or use the module data files
How to use ATUS microdata files
Home page ; http://www.bls.gov/nls/
Navigator page; register to use: https://www.nlsinfo.org/investigator/pages/login.jsp
The Investigator User's Guide describes how to use this website.
An available tutorial also teaches how to search for variables in the Investigator
Data page: https://nccs-data.urban.org/index.php
Data page:https://insideairbnb.com/get-the-data.html
Offers different csv data sets for listing in cities around the world
Data page: https://www.kaggle.com/c/walmart-recruiting-store-sales-forecasting/data
Historic sales data (csv files) for 45 stores located in different regions across the United States.
Data page: https://www.yelp.com/dataset/challenge
Yelp maintains a dataset for use in personal, educational, and academic purposes. It includes 6 million reviews spanning 189,000 businesses in 10 metropolitan areas. Students are welcome to participate.
Need to sign a form to get data first so I am not sure about file format
Data set page :https://data.cdc.gov/browse
By topic, sets can differ mostly zip files to download
Data set page: https://archive.ics.uci.edu/ml/datasets/Dow+Jones+Index
Predicting stock prices is a major application of data analysis and machine learning; zip file to download
Data page: WDI https://data.worldbank.org/data-catalog/world-development-indicators
Microdata Library https://microdata.worldbank.org/index.php/home
From the World Bank which is a source of financial and technical assistance to developing countries globally.
The organization is not an ordinary bank but rather a partnership to reduce poverty and support development.
Time Series - same phenomina over specificed length of time
Cross Section - 1 point of time in multiple places
Pooled - combination of time series and cross section data
Point readers to raw data by providing a Web address
- use "Retrieved from"
or a general place that houses data sets on the site
- use "Available from"
example:
United States Department of Housing and Urban Development. (2008). Indiana income limits [Data file]. Retrieved from https://www.huduser.org/Datasets/
IL/IL08/in_fy2008.pdf