osapublishing.org
robots.txt

Robots Exclusion Standard data for osapublishing.org

Resource Scan

Scan Details

Site Domain osapublishing.org
Base Domain osapublishing.org
Scan Status Ok
Last Scan2024-10-21T06:55:14+00:00
Next Scan 2024-11-20T06:55:14+00:00

Last Scan

Scanned2024-10-21T06:55:14+00:00
URL http://osapublishing.org/robots.txt
Redirect https://opg.optica.org/robots.txt
Redirect Domain opg.optica.org
Redirect Base optica.org
Domain IPs 38.95.177.60
Redirect IPs 65.202.222.45
Response IP 65.202.222.45
Found Yes
Hash f55cca85dd536ebf0297dd7634c719f6cf42e1e0ea6693b13da99cd4da16cc48
SimHash 54545992e284

Groups

*

Rule Path
Disallow /ViewMedia.cfm
Disallow /viewmedia.cfm
Disallow /user/
Disallow /openathens.cfm
Disallow /getImage.cfm
Disallow /getimage.cfm

googlebot

Rule Path
Disallow /user/referencelogin.cfm
Disallow /user/

bingbot

Rule Path
Disallow /user/referencelogin.cfm
Disallow /user/

ahrefsbot

Rule Path
Disallow /

mozilla/5.0 (compatible; msie 10.0; windows nt 6.1; trident/6.0) linkcheck by siteimprove.com

Rule Path
Disallow /

petalbot
adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot

Rule Path
Disallow /

Comments

  • Algolia-Crawler-Verif: FDE4C78B0290C718

Warnings

  • `disalow` is not a known field.