osapublishing.org
robots.txt
Robots Exclusion Standard data for osapublishing.org
Resource Scan
Scan Details
Site Domain | osapublishing.org |
Base Domain | osapublishing.org |
Scan Status | Ok |
Last Scan | 2024-10-21T06:55:14+00:00 |
Next Scan | 2024-11-20T06:55:14+00:00 |
Last Scan
Scanned | 2024-10-21T06:55:14+00:00 |
URL | http://osapublishing.org/robots.txt |
Redirect | https://opg.optica.org/robots.txt |
Redirect Domain | opg.optica.org |
Redirect Base | optica.org |
Domain IPs | 38.95.177.60 |
Redirect IPs | 65.202.222.45 |
Response IP | 65.202.222.45 |
Found | Yes |
Hash | f55cca85dd536ebf0297dd7634c719f6cf42e1e0ea6693b13da99cd4da16cc48 |
SimHash | 54545992e284 |
Groups
*
Rule | Path |
---|---|
Disallow | /ViewMedia.cfm |
Disallow | /viewmedia.cfm |
Disallow | /user/ |
Disallow | /openathens.cfm |
Disallow | /getImage.cfm |
Disallow | /getimage.cfm |
mozilla/5.0 (compatible; msie 10.0; windows nt 6.1; trident/6.0) linkcheck by siteimprove.com
Rule | Path |
---|---|
Disallow | / |
petalbot
adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot
Rule | Path |
---|---|
Disallow | / |
Warnings
- `disalow` is not a known field.
Comments