2. datasets
Fertility rate for women born in a given year
7. datasets
REASONING AUGMENTED RETRIEVAL (RAR) is the production-grade successor to single-pass RAG.
14. datasets
Need ideas for datasets (synthetic or real) in healthcare (Sharp + Fuzzy RD, Fixed Effects and DiD)
15. datasets
"Perfect silence" or "Noise" to focus ?
16. datasets
Data Clean/Quality is very boring right
18. datasets
I built an open Hebrew Wikipedia Sentences Corpus: 11M sentences from 366K articles, cleaned and deduplicated
20. datasets
Knowledge graph datasets extracted from FTX collapse articles and Giuffre v. Maxwell depositions
24. datasets
Need “subdivision” for an address (MLS is unreliable, county sometimes missing). What dataset/API exists?
25. datasets
The dataset's still a potential marketplace?
26. datasets
Ranking the S&P 500 by C-level turnover
33. datasets
Using TRAC-1 or TRAC-2 for cyberbullying detection
34. datasets
[R] SNIC: Synthesized Noise Dataset in RAW + TIFF Formats (6000+ Images, 4 Sensors, 30 scenes)
35. datasets
Epstein Graph: 1.3M+ searchable documents from DOJ, House Oversight, and estate proceedings with AI entity extraction
37. datasets
Looking for a Phishing Dataset with .eml files
39. datasets
I/B/E/S needed for analyst coverage data
40. datasets
How investigate performance issues in spark?
41. datasets
Active Directory Vulnerability Datasets
42. datasets
Large dataset of real (non synthetic) video
43. datasets
Discord for data hackers and tinkers
48. datasets
Men’s Mental Health (Adult Men) Research
51. datasets
S&P 500 Corporate Ethics Scores - 11 Dimensions
54. datasets
Final-year CS project: confused about how to construct a time-series dataset from network traffic (PCAP files)
55. datasets
[PAID] EU Amazon Product & Price Intelligence Dataset – 4M+ High-Value Products, Continuously Updated
60. datasets
Moltbook Dataset (Before Human and Bot spam)
61. datasets
Urgent help needed regarding a dataset!!!
64. datasets
Best resource for managing large datasets?
65. datasets
Platinum-CoT: High-Value Technical Reasoning. Distilled via Phi-4 → DeepSeek-R1 (70B) → Qwen 2.5 (32B) Pipeline
67. datasets
[NEW DATA] - Executive compensation dataset extracted from 100k+ SEC filings (2005-2022)
69. datasets
Analyzing Problems People face (school project)
74. datasets
Groundhog Day API: All historical predictions from all prognosticating groundhogs [self-promotion]
77. datasets
Le Refuge - Library Update / Real-world Human-AI interaction logs / [disclaimer] free AI-ressources.
80. datasets
Music Listening Data - Data from ~500k Users
84. datasets
Where to find traffic data for a specific road?
85. datasets
Lipid Nanoparticle Database (LNPDB): open-access structure-function dataset of ~20,000 lipid nanoparticles
89. datasets
Looking For Company 10-Ks and Financial Docs
91. datasets
Data center geolocation data in the US
92. datasets
dataset for forecasting and Time series
93. datasets
Precipitation datasets that you have used
95. datasets
Looking for a Real Pictures vs Ai Generated images
96. datasets
From BIT TO SUBIT --- (Full Monograph)