cafemedia.com
robots.txt
Robots Exclusion Standard data for cafemedia.com
Resource Scan
Scan Details
Site Domain | cafemedia.com |
Base Domain | cafemedia.com |
Scan Status | Ok |
Last Scan | 2024-11-14T04:00:22+00:00 |
Next Scan | 2024-11-21T04:00:22+00:00 |
Last Scan
Scanned | 2024-11-14T04:00:22+00:00 |
URL | https://cafemedia.com/robots.txt |
Domain IPs | 141.193.213.10, 141.193.213.11 |
Response IP | 141.193.213.10 |
Found | Yes |
Hash | cd4e8480840a8baba53360b9529f93f2c63deb205173390b1a8cf76ef4c1a49f |
SimHash | 0518ccd067f5 |
Groups
*
Rule | Path |
---|---|
Allow | /*?v |
Disallow | /calendar/action* |
Disallow | /events/action* |
Disallow | /*? |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |
Comments