crhemd.com
robots.txt

Robots Exclusion Standard data for crhemd.com

Resource Scan

Scan Details

Site Domain crhemd.com
Base Domain crhemd.com
Scan Status Ok
Last Scan2026-04-04T02:48:18+00:00
Next Scan 2026-05-04T02:48:18+00:00

Last Scan

Scanned2026-04-04T02:48:18+00:00
URL https://crhemd.com/robots.txt
Domain IPs 148.113.25.148
Response IP 148.113.25.148
Found Yes
Hash 4cdebd3c160244e40a9bacfa0e82c9756c9186d53c743dc2d23954652bf23a9e
SimHash 800dee32065f

Groups

*

Rule Path
Allow /*.js
Allow /*.css
Allow /*.jpg
Allow /*.png

google-extended

Rule Path
Allow /

gptbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

httrack disallow /
httrack disallow: /
netcaptor disallow /
netcaptor disallow: /
offline explorer disallow /
offline explorer disallow: /
spiderku/0.9 disallow /
spiderku/0.9 disallow: /
steeler disallow /
steeler disallow: /
webcopier v3.3 disallow /
webcopier v3.3 disallow: /
webcopier v3.2a disallow /
webcopier v3.2a disallow: /
webcopier disallow /
webcopier disallow: /
webcrawler disallow: /
web downloader/4.9 disallow /
web downloader/4.9 disallow: /
web downloader/5.8 disallow /
web downloader/5.8 disallow: /
webgather 3.0 disallow /
webgather 3.0 disallow: /
webstripper/2.56 disallow /
webstripper/2.56 disallow: /
webzip/3.65 disallow /
webzip/3.65 disallow: /
webzip disallow /
webzip disallow: /
wget disallow /
wget disallow: /
zao disallow /
zao disallow: /
zeus 2.6 disallow /
zeus 2.6 disallow: /

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://crhemd.com/sitemap.xml

Warnings

  • 3 invalid lines.