Diffbot 🤖 @diffbot
Never write another web scraper. Diffbot structures information from the web, so you don't have to. diffbot.com The Future Joined September 2009-
Tweets2K
-
Followers8K
-
Following8K
-
Likes10K
@dambuildshit Step 1: Build a free DNS provider Step 2: Collect tolls for access to sites behind their DNS Step 3: Build a crawler that doesn't need to pay their own toll Step 4: Profit
The web isn't a database. @diffbot makes it one. 10B+ entities and 1T facts extracted from 60B+ pages, rebuilt every 4-5 days. DuckDuckGo, Snapchat, and Dow Jones run on it. Massive powers the proxy infra behind their continuous crawl.
Ever wondered what your white name should have been? Introducing: whatismywhitename.com Upload a picture of you, and let the puppy guess your name! Let's test out nominative determinism 🫡 (Immigrants who named themselves will correlate more highly. Give us feedback plz) Our thanks to: - @modal for their generous credits toward training this meme model - @diffbot for the clean, diverse dataset! - @leannch86920 for the training research! - Everyone NOT named David (biggest & noisiest dataset ever)
@devanshu_twt Sorry! It’s not ideal but it’s the easiest way to weed out 99% of abusers. When the product makes it easy to crawl the web, you get a lot of bad actors. Still thinking of a better way to solve this!
@groby Sorry for the late reply (and happy new years!) It's not on the immediate horizon, but implementing a credit balance model with a low minimum is something we've discussed. I personally prefer it. Would you mind emailing me at jerome[@]diffbot?
State of E-commerce Data Providers - Q4 2025 E-commerce runs on constant measurement: prices, promos, availability, seller changes, and "what the shelf actually looks like" across retailers and marketplaces. The challenge is stable collection at scale, retries when sites break, anti-bot evasion, clean geo signals, and then turning messy HTML into usable structured data. In preparation for the holiday season, we mapped the landscape of e-commerce data providers: Competitive intel + digital shelf: @dataweavein, @Price2Spy, @bigdataNODE, @Profitero, @WiserInc Marketplace intelligence + data: @junglescout, @H10Software, @datahawkco, @SellerSprite_EN Trade, Supply Chain, Imports / Exports: @Trademo1, @ImportYeti, @datamyne Scraper APIs & Extraction Platforms: @zytedata, @diffbot, @Stratalis, (AutoScraping handle?), @serpapi Managed Data Extraction & Services: @groupBWT, @Data_Ox, @epctex, @MrScraper_ Retail Media & Ad Platforms: @Pacvue, @PerpetuaLabs, @Teikametrics Network & runtime infra for e-com scraping: @playwrightweb, Puppeteer, @browserless
@groby Wish granted. Will a $50 starting plan work?
YouTube, TikTok, Mastodon, & Threads are mostly there but need optimizing. Diffbot goes incredibly far with articles & that’s also moving along well. Reddit & Bluesky are readily available but I haven’t spent the time. X is finished by the endpoint gets rate limited 😞
I am in love with scraping using Shortcuts. I have about 8 sets of shortcuts for everyday social media sites that I'm developing in tandem. I'll be releasing them as I finish them – thread starts here.
Not Diffbot!
BREAKING: The Internet Massive outage being reported across platforms including Spotify, Google Cloud, AWS, Cloudflare, Claude, YouTube, Gmail, and many, many, more
San Diego developers, join us and our technical partners @neo4j, @Intuit, Eyepop.ai, @Replit , and @diffbot at our HackNight next week!
Check out the repo for more info: github.com/diffbot/diffbo…
89,886 developers are building their own Perplexity on-prem with Diffbot LLM — huggingface.co/diffbot/Llama-…
#Perplexity Sonar Pro API launched last week as the best performing model on factuality. 24 hours later, it's the 2nd best performing model (and it's not because of #DeepSeek). Why? 👇
Diffbot launches open-source AI model that achieves 81% accuracy by querying a trillion-fact Knowledge Graph in real-time instead of relying on static training data 🧠📊 Read more: venturebeat.com/ai/diffbots-ai… #ArtificialIntelligence #Enterprise #MachineLearning @diffbot
Gary Marcus @GaryMarcus
226K Followers 7K Following OG GenAI Skeptic; spoke at US Senate. Warned about hallucinations in 2001. Advocating world models & neurosymbolic AI ever since. Author, Marcus on AI & 6 books
Charly Wargnier @DataChaz
172K Followers 49K Following Ex @Streamlit @Snowflake Maestro • I write about AI agents, LLMs and automation • My ❤️ is open source • DM for collabs 📩
Evan Kirstel #B2B #Te... @EvanKirstel
378K Followers 311K Following TV host, Podcaster, Tech influencer, content creator, Industry Expert w/600K followers, focus on #Enterprise 💻 #Cloud ☁️#5G 📡#AI 🤖#Telecom ☎️ 🔑 #Cybersec
Bojan Tunguz @tunguz
286K Followers 8K Following Founder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Aaron Bradley @aaranged
11K Followers 954 Following Against fascism, so now https://t.co/eWVg6aRiWG. Still building your personal brand, doing marketing here? You won't care, but I have NO respect for you.
Glenn Gabe @glenngabe
83K Followers 8K Following SEO and AI Search Consultant at G-Squared Interactive focused on Google algorithm update recovery and AI Search visibility. Podcast: "SEO From The Front Lines".
Emma certified silly @demmav983
59 Followers 647 Following professional feeler & playlist sharer 🎵 follow back always
Abi @abigail_lajti
132 Followers 2K Following New Yorker in Berlin. Freelance Head of Growth 👩💻 Also building a granola company, like the food.
Andy Isaac @andyskale
3K Followers 9K Following Biz Dev @Skalenetwork💧| x402 enthusiast | Spacehost🎙️💙❤️
Nuwa @Nuwaonline
0 Followers 2K Following
d @d9381841867900
3 Followers 31 Following
Jeffrey @jeffreyjiao2025
0 Followers 22 Following
Benjamin Ramen @BenjaminRamen
0 Followers 21 Following
RyanΞHawks @ryanthawks
3K Followers 852 Following Excited about agentic AI, agent economies, and Ethereum.
TomatenPotaten 🍅 @TomatenPotaten
1 Followers 97 Following no time to think about what to put here right now
saloni @Saloni
1K Followers 1K Following Saloni (səloʊniː) = Twix+tech-loving consultant at McKinsey via Penn, MSFT and Kellogg. Heart firmly grounded in India. Not a South Indian film actress.
Dominus Tech @DominusTechTI
2K Followers 4K Following Revenda: #Adobe - #Arcserve - #Corel - #Mcafee - #Microsoft - #Oracle - #SAPBusinessOne - #Symantec - #Veritas - #Cisco - #HP - #HPE - #IBM - #Jujitsu - #Lenovo
Coco @ZCocooNTF
159 Followers 3K Following
ISAAC OJEDA GARCIA @ojeda_isaa21013
4 Followers 67 Following
Jason G @jdgellman
1 Followers 34 Following
carlos cifuentes @funkytamal
148 Followers 1K Following
LLM Relevance @llmrelevance
42 Followers 1K Following Curated tools to optimize your brand visibility in Google search and AI platforms, plus productivity tools to streamline your workflow. by @nicholaspatten
Pirlo丶 @gong_shuo
2 Followers 44 Following
musab mohamed @MMohamed59326
60 Followers 419 Following Entrepreneur | Domain Investor https://t.co/r1SBwib9zs https://t.co/T1oWs5dVns https://t.co/ysDIqCwkMJ https://t.co/bsABGLSnc4 https://t.co/jaibuai7SR https://t.co/o8p1gc8Pui https://t.co/m3eJk7WJpw https://t.co/lk0fg2dT7u
Peiman @PeimanS
0 Followers 962 Following
Abd ahmd @AbdullaAhammed8
293 Followers 2K Following Senior BDR @SkaleNetwork || Prev @FalconXGlobal @IBTxOfficial || Crypto Enthusiast ||
LiveGap Charts @LiveGapCharts
163 Followers 417 Following free online chart maker with real-time preview
Gyula Toth @GyulaToth8
21 Followers 773 Following
Victor Liu @VictorLOfficial
83 Followers 6K Following
On-Automate @4automate
0 Followers 3 Following Stocks📈 + Math🧮 = Your daily dose of market wisdom & mental gymnastics | Think different. Trade smarter.
Melvin Jackson @nt5ranger
603 Followers 8K Following Sr Systems Security Engine SC DAV,Army Vet. Airborne Ranger Sniper. CIB, Bgde TOC-Grid,Welcome to the Matrix! OSX & Sun Solaris Admin & MCSE DHS CISSP & CEH
Swen {👻,👻} @swencessi
6K Followers 5K Following
Contrebande @contrebande_co
11 Followers 287 Following Facteurs d'orgues à parfums. Bidouilleurs @Osmatique.
Matthew Cassinelli @mattcassinelli
17K Followers 10K Following Full-stack Siri developer. App Intents back-end consultant & Shortcuts front-end creator. Prev. team at Workflow before @Apple. Get my shortcuts 👇
Scuba 🤿 @scubadiving01
81 Followers 3K Following ˡᵒᵒᵏⁱⁿᵍ ᶠᵒʳʷᵃʳᵈ ᵗᵒ ᵗᵒᵐᵒʳʳᵒʷ. ᕙ(⇀‸↼‶)ᕗ God's favourite baby ☀︎༄.°
aero @aeroglade
0 Followers 768 Following
Oliver J. Scholten Ph... @ojscholten
72 Followers 236 Following Ex Hedge Fund Engineer | PhD Computer Scientist
Chandler T Wilson @chandlertwilson
1K Followers 2K Following Founder, bridge_ci - Machine Intelligence, OSINT and Alt Data for Finance, Geopolitics and Biz | Oxford Researcher of Complex Systems. MN Vikings fan.
AkameGaKill @hoktay07
43 Followers 806 Following
LindsayVan @9MC74Fg6Y2og1
42 Followers 2K Following
Marco Feiten @marcofeiten78
120 Followers 190 Following
Transhumanism, Futuri... @GreatArtDaily
15 Followers 2K Following
Luca Baggi @baggiponte
535 Followers 2K Following 📈 AI Engineer @ https://t.co/Du2lQ9AFgU 🗞 Ho scritto spiegoni @ilpost 🎓 MSc Econ & Stats @LaStatale 🎓 BA Filosofia @UniBergamo & @SorbonneParis1
HumanistAtypik @HumanistAtypik
427 Followers 4K Following Humaniste donc écosocialiste. Radical mais réformiste. Neuroatypique : #hpi #tdah #dysgraphique. Iconoclaste anticonformiste mais pas anti-tout ! ;-) #ric
Amin Sobor @infidelity707
24 Followers 591 Following ML Engineer & Full-stack Developer. Architecting Java Spring backends & premium UI/UX enhancements. Full-stack thinking, Machine Learning Intelligence.
Andi @andyeah20133
162 Followers 3K Following Machined rocket parts. Now I build tiny spaceships and light them like it's 1979. Miniatures · Experiments · Films · Machines
Gary Marcus @GaryMarcus
226K Followers 7K Following OG GenAI Skeptic; spoke at US Senate. Warned about hallucinations in 2001. Advocating world models & neurosymbolic AI ever since. Author, Marcus on AI & 6 books
Kirk Borne @KirkDBorne
486K Followers 3K Following Advisor to startups. Freelancer. @LeadershipData founder. Global Speaker. Top #B2B influencer & social promoter #DataScience #AI #ML. PhD Astrophysics @Caltech
Bojan Tunguz @tunguz
286K Followers 8K Following Founder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Aaron Bradley @aaranged
11K Followers 954 Following Against fascism, so now https://t.co/eWVg6aRiWG. Still building your personal brand, doing marketing here? You won't care, but I have NO respect for you.
swyx @swyx
163K Followers 4K Following achieve ambition with intentionality, intensity, integrity & insanity. affiliations: - @dxtipshq - @cognition - @temporalio - @aidotengineer - @latentspacepod
turbopuffer @turbopuffer
13K Followers 4 Following {vector, full-text} search engine built on object storage. fast, cheap, 1T scale. powers Anthropic, Cursor, Notion, and more
Simon Eskildsen @Sirupsen
18K Followers 832 Following co-founder & ceo @turbopuffer, former infra @Shopify 1k→1m RPS
Vivien Perrelle @PerrelleVivien
71 Followers 329 Following Building at the edge of AI, science, and trust. Currently building https://t.co/U3A0xtSnvK
Cheng Lou @_chenglou
69K Followers 449 Following Worked on: @reactjs, @messenger, @reasonml, @rescriptlang Currently: @midjourney, Pretext
Leann Chen @_leann_chen
93 Followers 228 Following
Rachel Blum @groby
2K Followers 1K Following I do things. Many things. I speak for nobody but myself, especially not for my employer. Former blue check. @[email protected] she/her. SJW. Queer.
Matthew Cassinelli @mattcassinelli
17K Followers 10K Following Full-stack Siri developer. App Intents back-end consultant & Shortcuts front-end creator. Prev. team at Workflow before @Apple. Get my shortcuts 👇
Chandler T Wilson @chandlertwilson
1K Followers 2K Following Founder, bridge_ci - Machine Intelligence, OSINT and Alt Data for Finance, Geopolitics and Biz | Oxford Researcher of Complex Systems. MN Vikings fan.
Juan Sequeda @juansequeda
6K Followers 942 Following Principal Strategist/Researcher @ServiceNow @HonestNoBSData podcast @UTCompSci PhD, 20+ years in Knowledge Graphs, Prev @datadotworld founder @Capsenta 🇺🇸🇨🇴
Alvin Chang @alv9n
7K Followers 994 Following Follow me here: https://t.co/xw1LwyaU1S Journalist / Assistant professor @TheNewSchool / https://t.co/jJ4Vf9PVsu
Eugene Yan @eugeneyan
27K Followers 661 Following MTS @AnthropicAI. Prev: Principal Applied Scientist @Amazon, led ML @ Alibaba, Lazada, Healthtech startup.
Adam DuVander @adamd
5K Followers 1K Following Author of Developer Marketing Does Not Exist. I help dev-focused marketers build a content strategy to reach more developers. Previously @zapier, @sendgrid
✌️Oᒪᗩᖴ KO�... @Olaf_Kopp
5K Followers 649 Following Co-Founder @Aufgesang 🏠 Founder SEO Research Suite🤓 SEO+LLMO/GEO + Digital Brand Building+Customer Journey Management🔥based in 🇩🇪 & 🇵🇹
Carlos Ortega @carlos_darko
15K Followers 693 Following 🔸 Papá de Lluvia 🔸 Consultor SEO independiente 🔸 Co-director del Máster SEO de Big School 🎯 CONSULTORÍA SEO: https://t.co/edS174dHUM
Lolita Taub @lolitataub
93K Followers 844 Following Latina VC @ganasvc | $100K pre-seed/seed checks | 100+ startups backed
Fred Patton @FirstFredition
44 Followers 193 Following A highly creative generalist and polyglot programmer. #functionalprogramming #quantumcomputing #domaindrivendesign #startups #eventsourcing #arts #filmart #life
Stephen Diehl @smdiehl
54K Followers 3K Following Left this hellsite for BlueSky. https://t.co/jct5Pfs2NT
Andrew Lih @fuzheado
18K Followers 2K Following Wikimedian-at-large at @Smithsonian, @MetMuseum @Wikimedia strategist, author The @Wikipedia Revolution; Tooting at @[email protected]
RelationalAI @RelationalAI
1K Followers 112 Following RelationalAI brings enterprise decision intelligence to Snowflake's AI Data Cloud.
Yishan @yishan
106K Followers 528 Following I run Terraformation, and I was once the CEO of Reddit. Both are very interesting challenges. AMA in a subscriber-only newsletter! https://t.co/zA2F2S7etG
Torben Schulz @torbschulz
711 Followers 629 Following Founder @RowsHQ, husband, father of 3. Interested in spreadsheets, startups, foreign affairs, and learning guitar for dummies.
Jonathan Zittrain @zittrain
45K Followers 9K Following A small creature who likes to run around in universities. Prof. @Harvard_Law, @HSEAS, + @Kennedy_School; @EFF board mbr; director of @BKCHarvard and @HLSLib.
Philip Vollet @philipvollet
30K Followers 8K Following VP Developer Relations and Growth @weaviate_io & Open source lover
Jim Hendler @jahendler
7K Followers 801 Following An OK Boomer; old time AI guy; Semantic Web evangelist (still); Web Science promoter; and "geek who speaks wonk"(@[email protected])
Olaf Hartig @olafhartig
3K Followers 167 Following I have moved: https://t.co/DX27DzQEPO Senior Associate Professor in Computer Science at Linköping University, Amazon Scholar
DuckDuckGo @DuckDuckGo
2.7M Followers 5 Following Independent online privacy company with browser, search engine, and optional AI.
DataJournalism.com @datajournalism
40K Followers 1K Following Where journalism meets data: https://t.co/Dq6e7ta5ub is a space to read, watch, and discuss everything data. Brought to you by @ejcnet.
Dave Ojeda @daveojeda
1K Followers 789 Following Semantic Web Enthusiast, Knowledge Graph Geek & Structured Data Wrangler. Schema Markup, SEO & data is my day job and herding kids is my love! DM is open to all
fotos coloridas @SEOwebsemantica
2 Followers 23 Following
Guillermo Galdámez @GGxKM
240 Followers 586 Following KM geek. Adventurous eater. Going placidly among the noise & the haste. @McGillU & @TecdeMty alumnus. All opinions are my own.
MIT Technology Review @techreview
1.2M Followers 3K Following Our in-depth reporting on innovation reveals and explains what’s really happening now to help you know what’s coming next.
Sarah Wentworth @SarahEWentworth
65 Followers 297 Following My data passion is making sense of massive streams of digital data... streaming; data from the sublime transactional to the Tweetonomics of tomorrow.
Connected Data @Connected_Data
7K Followers 2K Following Connecting Data, People & Ideas since 2016. Using relationships, meaning, context in Data to achieve great things #KnowledgeGraph #GraphDB #AI #SemTech
David Amerland 🇺�... @DavidAmerland
13K Followers 5K Following Time dilates for me. Contrib. @Forbes, @Inc. Represented by The Knight Agency. Latest book: "Built To Last". Occasionally political. Darebee Brand Ambassador.
Nicolas Torzec @nicolastorzec
2K Followers 12 Following Director of Research Engineering ML, AI and Knowledge Graphs for Web Search and E-Commerce Yahoo Labs | Kelkoo | Université de Rennes | France Telecom R&D
Paul Lopez @lopezunwired
2K Followers 939 Following Principal AI Architect at UnitedHealth Group (https://t.co/Vwfg1wBLNQ) • Modernizing Healthcare with AI *Personal Opinions*
Jake Ryan @TradecraftJake
6K Followers 2K Following Invest in #Bitcoin & #AutonomousOps CIO @TRADECRAFTc Author of @CRYPTODECRYPTD & @AgeOfAutonomy, '7 Best Books Crypto' @USNews & https://t.co/VA5vfh8kBX
Jeff Bauter Engel @JeffBauterEngel
2K Followers 565 Following Global Practice Editor @BainandCompany. Previously @xconomy @bizjournalmke @mnherald. @MarquetteU alum. Mitten State native. MSU Spartan fan.
Brian Dowling @be_d
1K Followers 972 Following boston courts and cases @blaw, prev. @law360 @bostonherald @hartfordcourant @columbiajourn @aquinascollege
Michael E. Docherty @medocherty
1K Followers 1K Following New venture creation at the intersection of big companies and startups. Author: Collective Disruption. https://t.co/E16CMB2fIO
GovLoop @GovLoop
24K Followers 4K Following GovLoop is the knowledge network for #government, connecting 300,000+ #federal, #state, and #local gov #innovators.Lewis Shepherd @lewisshepherd
6K Followers 5K Following Work in DC + Palo Alto, play on the web. Specialist in advanced technologies for governments, and I consult/teach/write for fun.



















