think-diary.com
robots.txt

Robots Exclusion Standard data for think-diary.com

Resource Scan

Scan Details

Site Domain think-diary.com
Base Domain think-diary.com
Scan Status Ok
Last Scan2025-08-16T16:45:26+00:00
Next Scan 2025-08-23T16:45:26+00:00

Last Scan

Scanned2025-08-16T16:45:26+00:00
URL https://think-diary.com/robots.txt
Redirect https://www.think-diary.com/robots.txt
Redirect Domain www.think-diary.com
Redirect Base think-diary.com
Domain IPs 118.27.122.154
Redirect IPs 118.27.122.154
Response IP 118.27.122.154
Found Yes
Hash 7a76ed7375276ff81bb6da8fb82e9abaeb82d2352fea87c896aa26915057c84b
SimHash 40001c60cfb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /http%3A//rsssitemap.xml
Allow /http%3A//rsslatest.xml
Allow /http%3A//htmlsitemap.htm

Other Records

Field Value
sitemap https://www.think-diary.com/wp-sitemap.xml
sitemap https://www.think-diary.com/xmlsitemap.xml