trashwiki.org
robots.txt

Robots Exclusion Standard data for trashwiki.org

Resource Scan

Scan Details

Site Domain trashwiki.org
Base Domain trashwiki.org
Scan Status Ok
Last Scan2024-06-11T02:31:01+00:00
Next Scan 2024-06-18T02:31:01+00:00

Last Scan

Scanned2024-06-11T02:31:01+00:00
URL https://trashwiki.org/robots.txt
Domain IPs 104.21.13.51, 172.67.132.155, 2606:4700:3031::ac43:849b, 2606:4700:3037::6815:d33
Response IP 172.67.132.155
Found Yes
Hash 4632e376e767106d1f6db3a2f3cdc4210a22582ffb95e0396cd67ecb66fa3f4f
SimHash 1161c13a4f13

Groups

*

Rule Path
Disallow /
Allow /en/Banana
Allow /en/Diver%27s_etiquette
Allow /en/Dumpster_diver
Allow /en/Fruit_juice_steamer
Allow /en/Furniture
Allow /en/Germany
Allow /en/How_to_prevent_dumpster_diving
Allow /en/Literature
Allow /en/Main_Page
Allow /en/Making_money
Allow /en/Money
Allow /en/Skipper
Allow /en/Trashwiki
Allow /en/Trashwiki.org%3AAbout
Allow /en/User%3ARobino

ia_archiver-web.archive.org

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

archive.org_bot

Rule Path
Disallow