turtaloha.com
robots.txt
Robots Exclusion Standard data for turtaloha.com
Resource Scan
Scan Details
Site Domain | turtaloha.com |
Base Domain | turtaloha.com |
Scan Status | Ok |
Last Scan | 2024-11-08T03:11:05+00:00 |
Next Scan | 2024-11-15T03:11:05+00:00 |
Last Scan
Scanned | 2024-11-08T03:11:05+00:00 |
URL | http://turtaloha.com/robots.txt |
Redirect | http://www.turtaloha.com/robots.txt |
Redirect Domain | www.turtaloha.com |
Redirect Base | turtaloha.com |
Domain IPs | 195.154.21.79 |
Redirect IPs | 195.154.21.65 |
Response IP | 195.154.21.65 |
Found | Yes |
Hash | 6352bf94d25933dd0f016495d91feb17c4ef60901b918f0ab3ff1368614284f3 |
SimHash | ab5edc0266b0 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /storage/do_xml/id/ |
Other Records
Field | Value |
---|---|
sitemap | http://www.turtaloha.com/sitemap.xml |