sciencebasedmedicine.org
robots.txt

Robots Exclusion Standard data for sciencebasedmedicine.org

Resource Scan

Scan Details

Site Domain sciencebasedmedicine.org
Base Domain sciencebasedmedicine.org
Scan Status Ok
Last Scan2024-10-31T01:10:31+00:00
Next Scan 2024-11-30T01:10:31+00:00

Last Scan

Scanned2024-10-31T01:10:31+00:00
URL https://sciencebasedmedicine.org/robots.txt
Domain IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.21
Found Yes
Hash cc8ecabe0f614fad86bf9223ce0bc90bf1ee7ad427a94174ead7d85346f25168
SimHash 259100e44cbd

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /?s=
Disallow *%26s%3D
Disallow /search
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

Other Records

Field Value
crawl-delay 30

yahoo-slurp

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

bingbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap http://sciencebasedmedicine.org/sitemap.xml