Skip to content

Commit 51354d1

Browse files
authored
Initial Upload - Datasets
1 parent 3785f2d commit 51354d1

File tree

1 file changed

+75
-0
lines changed

1 file changed

+75
-0
lines changed

Datasets.txt

Lines changed: 75 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,75 @@
1+
Data.gov
2+
NOAA - https://www.ncdc.noaa.gov/cdo-web/
3+
atmospheric, ocean
4+
Bureau of Labor Statistics - https://www.bls.gov/data/
5+
employment, inflation
6+
US Census Data - https://www.census.gov/data.html
7+
demographics, income, geo, time series
8+
Bureau of Economic Analysis - http://www.bea.gov/data/gdp/gross-domestic-product
9+
GDP, corporate profits, savings rates
10+
Federal Reserve - https://fred.stlouisfed.org/
11+
curency, interest rates, payroll
12+
Quandl - https://www.quandl.com/
13+
financial and economic
14+
15+
Data.gov.uk
16+
UK Dataservice - https://www.ukdataservice.ac.uk
17+
Census data and much more
18+
WorldBank - https://datacatalog.worldbank.org
19+
census, demographics, geographic, health, income, GDP
20+
IMF - https://www.imf.org/en/Data
21+
economic, currency, finance, commodities, time series
22+
OpenData.go.ke
23+
Kenya govt data on agriculture, education, water, health, finance, …
24+
https://data.world/
25+
Open Data for Africa - http://dataportal.opendataforafrica.org/
26+
agriculture, energy, environment, industry, …
27+
Kaggle - https://www.kaggle.com/datasets
28+
A huge variety of different datasets
29+
Amazon Reviews - https://snap.stanford.edu/data/web-Amazon.html
30+
35M product reviews from 6.6M users
31+
GroupLens - https://grouplens.org/datasets/movielens/
32+
20M movie ratings
33+
Yelp Reviews - https://www.yelp.com/dataset
34+
6.7M reviews, pictures, businesses
35+
IMDB Reviews - http://ai.stanford.edu/~amaas/data/sentiment/
36+
25k Movie reviews
37+
Twitter Sentiment 140 - http://help.sentiment140.com/for-students/
38+
160k Tweets
39+
Airbnb - http://insideairbnb.com/get-the-data.html
40+
A TON of data by geo
41+
UCI ML Datasets - http://mlr.cs.umass.edu/ml/
42+
iris, wine, abalone, heart disease, poker hands, ….
43+
Enron Email dataset - http://www.cs.cmu.edu/~enron/
44+
500k emails from 150 people
45+
From 2001 energy scandal. See the movie: The Smartest Guys in the Room.
46+
Spambase - https://archive.ics.uci.edu/ml/datasets/Spambase
47+
Emails
48+
Jeopardy Questions - https://www.reddit.com/r/datasets/comments/1uyd0t/200000_jeopardy_questions_in_a_json_file/
49+
200k Questions and answers in json
50+
Gutenberg Ebooks - http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
51+
Large collection of books
52+
53+
IMAGES
54+
ImageNet - http://image-net.org
55+
14M images of objects
56+
Google - https://ai.googleblog.com/2016/09/introducing-open-images-dataset.html
57+
9M image URLs with labels
58+
Microsoft Coco - http://cocodataset.org
59+
330k images, most labeled
60+
Labelled Faces in the Wild - http://vis-www.cs.umass.edu/lfw/
61+
13k face images with names
62+
Stanford Dogs - http://vision.stanford.edu/aditya86/ImageNetDogs/
63+
120 dog breeds, 20k images
64+
65+
AUTONOMOUS CARS
66+
Berkeley DeepDrive - https://bdd-data.berkeley.edu/
67+
Massive dataset including 100k videos with 1100 hours of hd driving
68+
Belgian Traffic Signs - http://www.vision.ee.ethz.ch/~timofter/traffic_signs/
69+
10k images
70+
Bosch Small Traffic Signals - https://hci.iwr.uni-heidelberg.de/node/6132
71+
5k training and 8k test images
72+
WPI Traffic Light, Pedestrian, Lane-Keeping - http://computing.wpi.edu/dataset.html
73+
30GB of training and test data from Worcester, Mass
74+
UCSD Lisa - http://cvrr.ucsd.edu/LISA/datasets.html
75+
Vehicle detection, traffic signals

0 commit comments

Comments
 (0)