think-diary.com
robots.txt
Robots Exclusion Standard data for think-diary.com
Resource Scan
Scan Details
Site Domain | think-diary.com |
Base Domain | think-diary.com |
Scan Status | Ok |
Last Scan | 2025-08-16T16:45:26+00:00 |
Next Scan | 2025-08-23T16:45:26+00:00 |
Last Scan
Scanned | 2025-08-16T16:45:26+00:00 |
URL | https://think-diary.com/robots.txt |
Redirect | https://www.think-diary.com/robots.txt |
Redirect Domain | www.think-diary.com |
Redirect Base | think-diary.com |
Domain IPs | 118.27.122.154 |
Redirect IPs | 118.27.122.154 |
Response IP | 118.27.122.154 |
Found | Yes |
Hash | 7a76ed7375276ff81bb6da8fb82e9abaeb82d2352fea87c896aa26915057c84b |
SimHash | 40001c60cfb2 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Allow | /http%3A//rsssitemap.xml |
Allow | /http%3A//rsslatest.xml |
Allow | /http%3A//htmlsitemap.htm |
Other Records
Field | Value |
---|---|
sitemap | https://www.think-diary.com/wp-sitemap.xml |
sitemap | https://www.think-diary.com/xmlsitemap.xml |