oldgoogle.neocities.org
robots.txt

Robots Exclusion Standard data for oldgoogle.neocities.org

Resource Scan

Scan Details

Site Domain oldgoogle.neocities.org
Base Domain neocities.org
Scan Status Ok
Last Scan2025-06-01T15:46:06+00:00
Next Scan 2025-07-01T15:46:06+00:00

Last Scan

Scanned2025-06-01T15:46:06+00:00
URL https://oldgoogle.neocities.org/robots.txt
Domain IPs 198.51.233.2, 2620:2:6000::a:1
Response IP 198.51.233.2
Found Yes
Hash f0e705653a2db1c9dfea19aa383ec0eb751e7b1c8a21b820d654fdaa59d08e38
SimHash b416c6e0a239

Groups

*

Rule Path
Disallow /Legacy/*
Disallow /news/*
Disallow /extern_js/*
Disallow /1998/*
Disallow /2009/*
Disallow /2010/*
Disallow /2011/*
Disallow /2013/*
Disallow /2015/*
Disallow /1998/
Disallow /2009/
Disallow /2010/
Disallow /2011/
Disallow /2013/
Disallow /2015/
Disallow /intl/en_ALL/images/*
Disallow /intl/en_ALL/*
Disallow /config/*
Disallow /more/*
Disallow /2009/webhp/
Disallow /2009/webhp/?
Disallow /2009/webhp
Disallow /2010/webhp/
Disallow /2010/webhp/?
Disallow /2010/webhp
Disallow /2011/webhp/
Disallow /2011/webhp/?
Disallow /2011/webhp
Disallow /2013/webhp/
Disallow /2013/webhp/?
Disallow /2013/webhp
Disallow /2009/search/
Disallow /2009/search/?
Disallow /2009/search
Disallow /2010/search/
Disallow /2010/search/?
Disallow /2010/search
Disallow /2011/search/
Disallow /2011/search/?
Disallow /2011/search
Allow /intl/en/
Allow /intl/en/about/
Allow /intl/en/labs/
Allow /intl/en/terms/
Allow /intl/en/privacy/
Allow /intl/en/products/