gjmedph.org
robots.txt

Robots Exclusion Standard data for gjmedph.org

Resource Scan

Scan Details

Site Domain gjmedph.org
Base Domain gjmedph.org
Scan Status Ok
Last Scan2025-11-12T04:06:21+00:00
Next Scan 2025-12-12T04:06:21+00:00

Last Scan

Scanned2025-11-12T04:06:21+00:00
URL https://gjmedph.org/robots.txt
Domain IPs 104.21.59.77, 172.67.218.164, 2606:4700:3030::ac43:daa4, 2606:4700:3033::6815:3b4d
Response IP 104.21.59.77
Found Yes
Hash a1d7088db63ff73d8d122c0d3743bd9c5fc8fc5acacb7213481286a739b53652
SimHash 4f101b72a733

Groups

*

Rule Path
Disallow /?
Disallow /*?
Disallow /*?page=
Disallow /cgi-bin*
Disallow /functions/sitemap-generation.php
Allow /*.css
Allow /*.js

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gjmedph.org/sitemap.xml

Warnings

  • `host` is not a known field.