caraharian.com
robots.txt
Robots Exclusion Standard data for caraharian.com
Resource Scan
Scan Details
Site Domain | caraharian.com |
Base Domain | caraharian.com |
Scan Status | Ok |
Last Scan | 2024-05-19T20:56:02+00:00 |
Next Scan | 2024-05-26T20:56:02+00:00 |
Last Scan
Scanned | 2024-05-19T20:56:02+00:00 |
URL | https://caraharian.com/robots.txt |
Domain IPs | 104.21.28.62, 172.67.170.103, 2606:4700:3032::6815:1c3e, 2606:4700:3033::ac43:aa67 |
Response IP | 172.67.170.103 |
Found | Yes |
Hash | fbb941028021751ab9595966b613471775f4b2e0f258daf874f02ba9aadb0d2c |
SimHash | 4d644880e99b |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wpo-plugins-tables-list.json |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://caraharian.com/sitemap_index.xml |
Comments