languagecrawler.com
robots.txt

Robots Exclusion Standard data for languagecrawler.com

Resource Scan

Scan Details

Site Domain languagecrawler.com
Base Domain languagecrawler.com
Scan Status Ok
Last Scan2025-10-12T02:18:34+00:00
Next Scan 2025-10-19T02:18:34+00:00

Last Scan

Scanned2025-10-12T02:18:34+00:00
URL https://www.languagecrawler.com/robots.txt
Domain IPs 2404:6800:4003:c02::79, 74.125.130.121
Response IP 172.217.194.121
Found Yes
Hash 23cf47212299ea76d2c8d729b6fd0ad0f435c5dabe840654feb9dc1248044b02
SimHash 4d54da50df53

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap https://www.languagecrawler.com/sitemap.xml