purakara.com
robots.txt
Robots Exclusion Standard data for purakara.com
Resource Scan
Scan Details
Site Domain | purakara.com |
Base Domain | purakara.com |
Scan Status | Ok |
Last Scan | 2024-09-25T07:04:16+00:00 |
Next Scan | 2024-09-26T07:04:16+00:00 |
Last Scan
Scanned | 2024-09-25T07:04:16+00:00 |
URL | https://purakara.com/robots.txt |
Domain IPs | 192.0.78.131, 192.0.78.231 |
Response IP | 192.0.78.131 |
Found | Yes |
Hash | 9b52ecd07b8ba53d7466982d6c3a8e960f54e26c9905849eeb2f3811a28d793c |
SimHash | 49800ca42bb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Allow | /http%3A//rsssitemap.xml |
Allow | /http%3A//rsslatest.xml |
Allow | /http%3A//htmlsitemap.htm |
Other Records
Field | Value |
---|---|
sitemap | https://purakara.com/sitemap.xml |
sitemap | https://purakara.com/news-sitemap.xml |
sitemap | https://purakara.com/xmlsitemap.xml |