osapublishing.org
robots.txt

Robots Exclusion Standard data for osapublishing.org

Resource Scan

Scan Details

Site Domain osapublishing.org
Base Domain osapublishing.org
Scan Status Ok
Last Scan2024-09-21T06:54:57+00:00
Next Scan 2024-10-21T06:54:57+00:00

Last Scan

Scanned2024-09-21T06:54:57+00:00
URL http://osapublishing.org/robots.txt
Redirect https://opg.optica.org/robots.txt
Redirect Domain opg.optica.org
Redirect Base optica.org
Domain IPs 38.95.177.60
Redirect IPs 65.202.222.45
Response IP 65.202.222.45
Found Yes
Hash 221d9b705db94fade67fb8edcb577691fafe066219dd49fbd266ae1a03ea89e1
SimHash 54545905e684

Groups

*

Rule Path
Disallow /ViewMedia.cfm
Disallow /viewmedia.cfm
Disallow /user/
Disallow /openathens.cfm
Disallow /getImage.cfm
Disallow /getimage.cfm

googlebot

Rule Path
Disallow /user/referencelogin.cfm
Disallow /user/

bingbot

Rule Path
Disallow /user/referencelogin.cfm
Disallow /user/

ahrefsbot

Rule Path
Disallow /

mozilla/5.0 (compatible; msie 10.0; windows nt 6.1; trident/6.0) linkcheck by siteimprove.com

Rule Path
Disallow /

petalbot
adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot

Rule Path
Disallow /

Warnings

  • `disalow` is not a known field.