rsc.org
robots.txt

Robots Exclusion Standard data for rsc.org

Resource Scan

Scan Details

Site Domain rsc.org
Base Domain rsc.org
Scan Status Ok
Last Scan2024-11-13T10:01:13+00:00
Next Scan 2024-11-20T10:01:13+00:00

Last Scan

Scanned2024-11-13T10:01:13+00:00
URL https://rsc.org/robots.txt
Redirect https://www.rsc.org/robots.txt
Redirect Domain www.rsc.org
Redirect Base rsc.org
Domain IPs 78.25.196.148
Redirect IPs 78.25.196.148
Response IP 78.25.196.148
Found Yes
Hash 87d5bfbc2ad8b13bd4a5b77f84d85db352c6db9ac0d1cde67f72fc1c48515422
SimHash 308cc2c04d03

Groups

gsa-crawler

Rule Path
Disallow

guidebot

Rule Path
Disallow /

*

Rule Path
Disallow /Membership/Memberzone/
Disallow /is/
Disallow /publishing/journals/rssfeed.asp
Disallow /pdf/members/newsletters/womenchemists_apr02.pdf
Disallow /AboutUs%5CNews/PressReleases/2009/UVToothBleaching.asp
Disallow /Publishing/ChemScience/Volume/2009/03/Turning_the_light_off_on_tooth_bleaching.asp
Disallow /Publishing/EdSymp/
Disallow /Publishing/ATHENS_index.asp
Disallow /images/EdSymp2010_reportforweb_tcm18-185058.pdf
Disallow /Labs/
Disallow /placesofchemistry/
Disallow /rsc-id/
Disallow /images/WorkshopA_Impact_tcm18-177392.ppt
Disallow /images/WorkshopB_EditorialBoards_tcm18-177393.ppt
Disallow /images/_WorkshopC_peer%20review%20workshop_tcm18-179104.ppt
Disallow /images/WorkshopD_ChemSpider_tcm18-177395.ppt
Disallow /images/WorkshopE_eplatform_tcm18-177396.ppt
Disallow /images/WorkshopF_Intl%20Dev_tcm18-177397.ppt
Disallow /images/WorkshopG_SUNAM_RSCEditors_tcm18-177398.ppt
Disallow /images/WorkshopH_SocialMedia_tcm18-177399.ppt
Disallow /images/WorkshopI_OpenAccess_tcm18-177400.ppt
Disallow /images/WorkshopJ_CelinaRamjoue_tcm18-177401.ppt

slurp

Rule Path
Disallow /membership/benefits/join_acs.asp
Disallow /chemistryworld/subscribe_acs.asp
Disallow /Education/SchoolStudents/Olympiad/paper2011.asp
Disallow /Membership/e-Membership/app-help.asp
Disallow /suppdata/
Disallow /chemistryworld/_denial.asp
Disallow /education/eic/_denial.asp
Disallow /publishing/_denial.asp
Disallow /publishing/currentawareness/_denial.asp
Disallow /publishing/journals/_denial.asp
Disallow /membership/_denial.asp

Other Records

Field Value
crawl-delay 30

Comments

  • ACAP version=1.0
  • allow contracted search
  • block GuideBot
  • block robots
  • Editors symposium files
  • allow contracted search
  • User-agent: gsa-crawler
  • block GuideBot
  • User-agent: Guidebot
  • Disallow: /
  • block robots
  • User-agent: *
  • Disallow: /Membership/Memberzone/
  • Disallow: /is/
  • Disallow: /publishing/journals/rssfeed.asp
  • Yahoo crawl
  • Conference Pages
  • Exam File for Robert Bowles
  • e-Membership
  • HD75942 11:11 08/02/2012
  • Denial URLs

Warnings

  • `acap-crawler` is not a known field.
  • `acap-disallow-crawl` is not a known field.