gazeta19.ru
robots.txt
Robots Exclusion Standard data for gazeta19.ru
Resource Scan
Scan Details
Site Domain | gazeta19.ru |
Base Domain | gazeta19.ru |
Scan Status | Ok |
Last Scan | 2024-09-23T04:17:51+00:00 |
Next Scan | 2024-09-30T04:17:51+00:00 |
Last Scan
Scanned | 2024-09-23T04:17:51+00:00 |
URL | https://gazeta19.ru/robots.txt |
Domain IPs | 195.211.251.41 |
Response IP | 195.211.251.41 |
Found | Yes |
Hash | 101148fe9a755c55e818e25b10d1080cb6bd189abd87fde159c19644938b7964 |
SimHash | 631d155943fc |
Groups
*
Rule | Path |
---|---|
Disallow | /administrator/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /cgi-bin/ |
Disallow | /cli/ |
Disallow | /components/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /images/raxo_thumbs/ |
Disallow | /language/ |
Disallow | /layouts/ |
Disallow | /libraries/ |
Disallow | /logs/ |
Disallow | /log/ |
Disallow | /modules/ |
Disallow | /plugins/ |
Disallow | /some/ |
Disallow | /templates/ |
Disallow | /tmp/ |
Other Records
Field | Value |
---|---|
crawl-delay | 0.1 |
Warnings
- `host` is not a known field.
Comments