rarediseases.org
robots.txt

Robots Exclusion Standard data for rarediseases.org

Resource Scan

Scan Details

Site Domain rarediseases.org
Base Domain rarediseases.org
Scan Status Ok
Last Scan2024-06-11T05:06:47+00:00
Next Scan 2024-07-11T05:06:47+00:00

Last Scan

Scanned2024-06-11T05:06:47+00:00
URL https://rarediseases.org/robots.txt
Domain IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.20
Found Yes
Hash 2ac728cd2105348283fc8037326e6150f0f31ac6f1d8a454f10b9c1055bf77df
SimHash e27a980cc793

Groups

*

Rule Path
Disallow
Disallow /cgi-bin/
Disallow /events/tag/
Disallow /events/category/
Disallow /events/list/
Disallow /search/
Disallow /?
Disallow /page/
Disallow /es/?

Other Records

Field Value
crawl-delay 600

Other Records

Field Value
sitemap https://rarediseases.org/sitemap_index.xml