en.cppreference.com
robots.txt

Robots Exclusion Standard data for en.cppreference.com

Resource Scan

Scan Details

Site Domain en.cppreference.com
Base Domain cppreference.com
Scan Status Ok
Last Scan2024-05-23T02:26:24+00:00
Next Scan 2024-06-22T02:26:24+00:00

Last Scan

Scanned2024-05-23T02:26:24+00:00
URL https://en.cppreference.com/robots.txt
Domain IPs 2604:4f00::3:0:1238:1, 74.114.90.20
Response IP 74.114.90.20
Found Yes
Hash 5b9cc61078712c034a9bc424f6cb255487c098a30dca31354caee002244b6705
SimHash f502d110403a

Groups

*

Rule Path
Allow /
Disallow /wiki_old/
Disallow /mwiki/
Disallow /tmpw/
Disallow /w/Special%3A
Disallow /w/Template%3A
Disallow /w/Mediawiki%3A
Disallow /w/Talk%3A

awariorssbot
awariosmartbot
awariobot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10