procs.lt
robots.txt

Robots Exclusion Standard data for procs.lt

Resource Scan

Scan Details

Site Domain procs.lt
Base Domain procs.lt
Scan Status Ok
Last Scan2024-06-27T19:15:22+00:00
Next Scan 2024-07-04T19:15:22+00:00

Last Scan

Scanned2024-06-27T19:15:22+00:00
URL https://procs.lt/robots.txt
Domain IPs 104.21.30.127, 172.67.172.236, 2606:4700:3033::6815:1e7f, 2606:4700:3035::ac43:acec
Response IP 172.67.172.236
Found Yes
Hash fb601b2a8b7636e5e3a057f0d4d47101646b2e32e8eaa89dbee991fe4afbb995
SimHash 6b38cc68cb92

Groups

*

Rule Path
Disallow
Disallow /?blackhole
Disallow /wp-admin/
Disallow /cgi-bin/

Other Records

Field Value
crawl-delay 120

Other Records

Field Value
sitemap https://www.procs.lt/sitemap.xml
sitemap https://www.procs.lt/sitemap.xml.gz

Warnings

  • `host` is not a known field.