archlinux.site
robots.txt

Robots Exclusion Standard data for archlinux.site

Resource Scan

Scan Details

Site Domain archlinux.site
Base Domain archlinux.site
Scan Status Ok
Last Scan2024-10-09T00:41:53+00:00
Next Scan 2024-10-16T00:41:53+00:00

Last Scan

Scanned2024-10-09T00:41:53+00:00
URL https://archlinux.site/robots.txt
Redirect https://www.archlinux.site/robots.txt
Redirect Domain www.archlinux.site
Redirect Base archlinux.site
Domain IPs 2001:4860:4802:32::15, 2001:4860:4802:34::15, 2001:4860:4802:36::15, 2001:4860:4802:38::15, 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 172.217.194.121, 2404:6800:4003:c03::79
Response IP 74.125.68.121
Found Yes
Hash 939b84f9a22ed1672db955e488a45842c8729aca2ce0f56c528be020075e62d8
SimHash 4904de50c612

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap https://www.archlinux.site/sitemap.xml