David Smooke
Smooke
AI & ML interests
data, software, news, currency, cryptocurrency, software development, llms, internet usage, software market shares, startup investment data, startup location data, hackernoon
Recent Activity
posted an update about 15 hours ago
Before https://huggingface.co/chatgpt launched, ~5% of new web articles were AI-generated. By November 2024 that crossed 50%. By April 2025, 74% of new web pages contained AI-generated content.
What AI industrializes isn't bad content — it's plausible mediocrity. Grammatically correct, structurally coherent, superficially persuasive, and almost indistinguishable from average.
Researchers have already warned that AI-generated survey papers are flooding https://huggingface.co/arxiv-community — what was once a labor-intensive exercise in critical synthesis has become a low-barrier, high-volume output burying original work.
I wrote about why internet communities struggle to publish quality over quantity and what that means for every platform that doesn't actively resist it: https://hackernoon.com/why-internet-communities-struggle-to-publish-quality-over-quantity posted an update 4 months ago
New https://huggingface.co/HackerNoon Post: The Words of Interest Benchmark Test For Matching an LLM to Your Interests https://hackernoon.com/the-words-of-interest-benchmark-test-for-matching-an-llm-to-your-interests
By picking individual words instead phrases or paraphrases or passages, this test bypasses plot summaries (which are everywhere regurgitating themselves online) and focuses on the author's words. It reveals whether an AI has truly "absorbed" the specific texture of a book or is simply echoing the general internet consensus.