nshm.com
robots.txt

Robots Exclusion Standard data for nshm.com

Resource Scan

Scan Details

Site Domain nshm.com
Base Domain nshm.com
Scan Status Ok
Last Scan2025-10-12T19:18:13+00:00
Next Scan 2025-11-11T19:18:13+00:00

Last Scan

Scanned2025-10-12T19:18:13+00:00
URL https://nshm.com/robots.txt
Domain IPs 104.21.45.190, 172.67.218.70, 2606:4700:3030::6815:2dbe, 2606:4700:3034::ac43:da46
Response IP 104.21.45.190
Found Yes
Hash a932712a6176186f502fcc628b1460dd9f247c489a826f8c47d1e9f66c11e847
SimHash 48189800eda1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /old-site-backup/
Allow /wp-admin/admin-ajax.php

chatgpt-user

Rule Path
Allow /

chatgpt-plugins

Rule Path
Allow /

claudebot

Rule Path
Allow /

google-extended

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

anthropic-llm

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.nshm.com/sitemap.xml

Comments

  • LLM Bots - Allow access