1. datasets
Fertility rate for women born in a given year
6. datasets
REASONING AUGMENTED RETRIEVAL (RAR) is the production-grade successor to single-pass RAG.
13. datasets
Need ideas for datasets (synthetic or real) in healthcare (Sharp + Fuzzy RD, Fixed Effects and DiD)
14. datasets
"Perfect silence" or "Noise" to focus ?
15. datasets
Data Clean/Quality is very boring right
17. datasets
I built an open Hebrew Wikipedia Sentences Corpus: 11M sentences from 366K articles, cleaned and deduplicated
19. datasets
Knowledge graph datasets extracted from FTX collapse articles and Giuffre v. Maxwell depositions
23. datasets
Need “subdivision” for an address (MLS is unreliable, county sometimes missing). What dataset/API exists?
24. datasets
The dataset's still a potential marketplace?
25. datasets
Ranking the S&P 500 by C-level turnover
32. datasets
Using TRAC-1 or TRAC-2 for cyberbullying detection
33. datasets
[R] SNIC: Synthesized Noise Dataset in RAW + TIFF Formats (6000+ Images, 4 Sensors, 30 scenes)
34. datasets
Epstein Graph: 1.3M+ searchable documents from DOJ, House Oversight, and estate proceedings with AI entity extraction
36. datasets
Looking for a Phishing Dataset with .eml files
38. datasets
I/B/E/S needed for analyst coverage data
39. datasets
How investigate performance issues in spark?
40. datasets
Active Directory Vulnerability Datasets
41. datasets
Large dataset of real (non synthetic) video
42. datasets
Discord for data hackers and tinkers
47. datasets
Men’s Mental Health (Adult Men) Research
50. datasets
S&P 500 Corporate Ethics Scores - 11 Dimensions
53. datasets
Final-year CS project: confused about how to construct a time-series dataset from network traffic (PCAP files)
54. datasets
[PAID] EU Amazon Product & Price Intelligence Dataset – 4M+ High-Value Products, Continuously Updated
59. datasets
Moltbook Dataset (Before Human and Bot spam)
60. datasets
Urgent help needed regarding a dataset!!!
63. datasets
Best resource for managing large datasets?
64. datasets
Platinum-CoT: High-Value Technical Reasoning. Distilled via Phi-4 → DeepSeek-R1 (70B) → Qwen 2.5 (32B) Pipeline
66. datasets
[NEW DATA] - Executive compensation dataset extracted from 100k+ SEC filings (2005-2022)
68. datasets
Analyzing Problems People face (school project)
73. datasets
Groundhog Day API: All historical predictions from all prognosticating groundhogs [self-promotion]
76. datasets
Le Refuge - Library Update / Real-world Human-AI interaction logs / [disclaimer] free AI-ressources.
79. datasets
Music Listening Data - Data from ~500k Users
83. datasets
Where to find traffic data for a specific road?
84. datasets
Lipid Nanoparticle Database (LNPDB): open-access structure-function dataset of ~20,000 lipid nanoparticles
88. datasets
Looking For Company 10-Ks and Financial Docs
90. datasets
Data center geolocation data in the US
91. datasets
dataset for forecasting and Time series
92. datasets
Precipitation datasets that you have used
94. datasets
Looking for a Real Pictures vs Ai Generated images
95. datasets
From BIT TO SUBIT --- (Full Monograph)