Skip Navigation

Unofficial Reddit API

I am making a Unofficial Reddit API, which mimics the official one.

Its early days, but I would like to have a discussion here about it since my post was blocked on reddit(of course).

Let me know what you think of the project, if you have any input, let me know.

108 comments
  • Please don't take personal offense, but you have merely a project scaffold with an unrealistic goal that will be blocked and C&D'd into the ground, without any other projects created.

    It doesn't matter how hard you're working on your anonymity, this project will be ripped apart by a horde of lawyers in seconds. You're not only doing something questionable or against ToS, you're directly attacking and sabotaging their monetization. This will not be taken lightly by the legal team of reddit.

    You want to provide a better, cooler, more robust and other random buzzwords API than the own of reddit. So, you alone, want to provide a better API than the whole team of reddit does for their absolute core product, all by scraping. This is simply not realistic.

    While we're at the topic of monetization, scraping, ETL into your own model and providing the API - for the amount of content that reddit has (quantity, not quality) this will be a highly resource intensive task. How do you plan to fund that, since your API will be better than the official one, I can expect at least the same performance as well, right?

    And also, most importantly, even if you magically achieve working around all that and get that working - why? Who is your expected user group? Pretty much every software using reddit moved away from reddit or simply has died. AI gen content is rampant, and most discussions seem like bots talking to bots. There is literally nothing to gain from an API to reddit - so why would anyone bother using it?

  • Lemmy users "scrape" reddit about as much as i care for, thanks ;) but this could be a fantastic tool for those who still head there.

    Awesome

  • Is there a reason you're scraping data rather than attaching a network sniffer/reverse engineering the official apps and documenting the results? Or map the RSS feed to an API? The main thrust behind my comment is that I think scraping is pretty fragile, so I'm interested as to why other options are infeasible.

    • There's currently no implementation (the repos are currently just skeletons), so it could just be a semantics difference right now.

    • Wouldn't those other options be C&D'd?

      *I am a layman

    • Because we need to retain the breadth of functionality the API has, if you want to just scrape posts, APIs for that already exist, but i am aiming for something more.

      About reverse engineering, they can change that part at any time too, and may be even more fragile as they can change that without breaking the UX, if they change the front page CSS selectors or layout for example, it will effect the UX more as it changes the expected output, not the middle end that is just raw data.

      Thats my reasoning, I appreciate the input though (:

      • Making a breaking change to the mobile API also breaks old outdated installations of the app. Websites and their APIs are usually synced, apps not so.

        If they were really motivated to stop your method, they could just obfuscate the frontend with webpack and break your scraper every time they make an update.

  • Basically you want to write scraping solution specially for Reddit, it would be great if you started with scraping Frameworks like python scrapy framework

  • Now, if only to get their auto bots to stopping banning accounts for little to no reasons. If you disagree with the wrong mod or they don't like what to you say, they ban you.

    My 12yr old account got banned. I'm not worried about the link karma and comment karma.

108 comments