bring back Facebook by scraping m.facebook.com

https://m.facebook.com/ is a little-known "lite" version of Facebook's full webapp with no JS and fairly simple HTML. it requires login, specifically `c_user` and `xs` cookies, but it's eminently scrapeable. https://facebook-atom.appspot.com/ already scrapes it to generate Atom feeds. apart from how distasteful it is to scrape with login cookies, we could scrape it like Instagram to bring back Facebook backfeed!

...sadly, FB's blocking is better than IG's. i actually implemented the scraping and extracted posts, comments, and likes/reactions, but i haven't been able to fetch users' timelines consistently. after one or two requests, FB consistently starts redirecting requests to `/login.php`, even with _all_ cookies that m.facebook.com gives me, fully spoofed User-Agent, and fetching from the same IP I logged in from. maybe browser fingerprinting? got me. this is where i stop digging. scraping, ugh.

related:
* https://github.com/rugantio/fbcrawl, a more fleshed out project that heavily scrapes https://mbasic.facebook.com/ (an alias for https://m.facebook.com/ afaict)
* https://github.com/snarfed/bridgy/issues/826#issuecomment-394136631, earlier nod toward this idea when bridgy FB originally died

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bring back Facebook by scraping m.facebook.com #886

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

bring back Facebook by scraping m.facebook.com #886

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions