procs.lt
robots.txt
Robots Exclusion Standard data for procs.lt
Resource Scan
Scan Details
Site Domain | procs.lt |
Base Domain | procs.lt |
Scan Status | Ok |
Last Scan | 2024-06-27T19:15:22+00:00 |
Next Scan | 2024-07-04T19:15:22+00:00 |
Last Scan
Scanned | 2024-06-27T19:15:22+00:00 |
URL | https://procs.lt/robots.txt |
Domain IPs | 104.21.30.127, 172.67.172.236, 2606:4700:3033::6815:1e7f, 2606:4700:3035::ac43:acec |
Response IP | 172.67.172.236 |
Found | Yes |
Hash | fb601b2a8b7636e5e3a057f0d4d47101646b2e32e8eaa89dbee991fe4afbb995 |
SimHash | 6b38cc68cb92 |
Groups
*
Rule | Path |
---|---|
Disallow | |
Disallow | /?blackhole |
Disallow | /wp-admin/ |
Disallow | /cgi-bin/ |
Other Records
Field | Value |
---|---|
crawl-delay | 120 |
Other Records
Field | Value |
---|---|
sitemap | https://www.procs.lt/sitemap.xml |
sitemap | https://www.procs.lt/sitemap.xml.gz |
Warnings
- `host` is not a known field.