notesbythalia.com
robots.txt

Robots Exclusion Standard data for notesbythalia.com

Resource Scan

Scan Details

Site Domain notesbythalia.com
Base Domain notesbythalia.com
Scan Status Ok
Last Scan2025-08-25T01:55:12+00:00
Next Scan 2025-09-24T01:55:12+00:00

Last Scan

Scanned2025-08-25T01:55:12+00:00
URL https://notesbythalia.com/robots.txt
Domain IPs 104.21.94.244, 172.67.141.219, 2606:4700:3030::6815:5ef4, 2606:4700:3037::ac43:8ddb
Response IP 172.67.141.219
Found Yes
Hash 3a4a9e4fb9c6987e956e9004ca56153331b31c989e34b657de615fee8593806b
SimHash 69189102e696

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*add-to-cart%3D*
Disallow /lib
Disallow /core
Disallow /cdn-cgi
Disallow /wp-json
Disallow /storage

googlebot

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/

ccbot

Rule Path
Disallow

gptbot

Rule Path
Disallow

google-extended

Rule Path
Disallow /

claudebot

Rule Path
Disallow

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

missinglettrbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://notesbythalia.com/sitemaps.xml

Comments

  • Link to your sitemap
  • Block Search Results
  • Block ChatGPT bot
  • Block Bard bot
  • Block Claude bot
  • Block Petal bot
  • Block SemrushBot
  • Block MajesticSEOBot
  • Block Missinglettr