amp.reddit.com
robots.txt

Robots Exclusion Standard data for amp.reddit.com

Resource Scan

Scan Details

Site Domain amp.reddit.com
Base Domain reddit.com
Scan Status Ok
Last Scan2024-05-19T19:12:18+00:00
Next Scan 2024-06-02T19:12:18+00:00

Last Scan

Scanned2024-05-19T19:12:18+00:00
URL https://amp.reddit.com/robots.txt
Redirect https://www.reddit.com/robots.txt
Redirect Domain www.reddit.com
Redirect Base reddit.com
Domain IPs 151.101.1.140, 151.101.129.140, 151.101.193.140, 151.101.65.140
Redirect IPs 151.101.1.140, 151.101.129.140, 151.101.193.140, 151.101.65.140
Response IP 151.101.109.140
Found Yes
Hash d32110c3945e957c19f64aa3366d5cd09259a413b31c09a20a9bdb4db6638629
SimHash 0417ebca6593

Groups

voltron

Rule Path
Disallow /

bender

Rule Path
Disallow /my_shiny_metal_ass

gort

Rule Path
Disallow /earth

mj12bot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

*

Rule Path
Disallow /*.json
Disallow /*.json-compact
Disallow /*.json-html
Disallow /*.xml
Disallow /*.rss
Allow /r/*.rss
Disallow /r/*/search.rss
Disallow /r/*/comments/*.rss
Disallow /r/*/config/*.rss
Disallow /r/*/wiki/*.rss
Disallow /*.i
Disallow /*.embed
Disallow /*/comments/*?*sort=
Disallow */comment/*
Allow /r/*/comments/*/*/de/*
Allow /r/*/comments/*/*/es/*
Allow /r/*/comments/*/*/fr/*
Allow /r/*/comments/*/*/pt/*
Allow /r/*/comments/*/*/it/*
Disallow /r/*/comments/*/*/*/*
Disallow /r/*/submit$
Disallow /r/*/submit/$
Disallow /message/compose*
Disallow /api
Disallow /post
Disallow /submit
Disallow /goto
Disallow /*before%3D
Disallow /domain/*t%3D
Disallow /login
Disallow /remove_email/t2_*
Disallow /r/*/user/
Disallow /gold?
Disallow /search$
Disallow /search?q=
Disallow /search/
Disallow /*/search?
Disallow /*/search/?
Disallow /*/search$
Disallow /*/search/$
Disallow /search.compact$
Disallow /*/search.compact$
Allow /r/*/comments/*/search/$
Allow /r/*/comments/*/search$
Disallow /static/button/button1.js
Disallow /static/button/button1.html
Disallow /static/button/button2.html
Disallow /static/button/button3.html
Disallow /subreddits/*
Disallow /buttonlite.js
Disallow /timings/perf
Disallow /counters/client-screenview
Disallow /*?*feed=
Disallow /svc/shreddit/*
Disallow /svc/sh/*
Disallow /svc/web/*
Disallow /graphql
Disallow /errors$
Disallow /live/*
Disallow /mediaembed/*
Disallow /media
Allow /
Allow /sitemaps/*.xml
Allow /posts/*

Comments

  • Our robots.txt is for search engines
  • 80legs
  • 80legs' new crawler

Warnings

  • 2 invalid lines.