entertainment.theonion.com
robots.txt

Robots Exclusion Standard data for entertainment.theonion.com

Resource Scan

Scan Details

Site Domain entertainment.theonion.com
Base Domain theonion.com
Scan Status Ok
Last Scan2024-04-25T11:39:11+00:00
Next Scan 2024-05-25T11:39:11+00:00

Last Scan

Scanned2024-04-25T11:39:11+00:00
URL https://entertainment.theonion.com/robots.txt
Redirect https://www.theonion.com/robots.txt
Redirect Domain www.theonion.com
Redirect Base theonion.com
Domain IPs 151.101.130.166, 151.101.194.166, 151.101.2.166, 151.101.66.166
Redirect IPs 151.101.130.166, 151.101.194.166, 151.101.2.166, 151.101.66.166
Response IP 151.101.130.166
Found Yes
Hash 20bbc9e21a5f5eefe558ae56674ef8426f07680ce046d395772254a13b51036b
SimHash 09157a11ebb2

Groups

*

Rule Path
Disallow /stats/
Disallow /api/
Disallow /ajax/
Disallow /embed/
Disallow /setbucket*
Disallow /game/score/*
Disallow /game/summary/*
Disallow /search$
Disallow /search?

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.theonion.com/sitemap.xml