oceanlibrary.com
robots.txt

Robots Exclusion Standard data for oceanlibrary.com

Resource Scan

Scan Details

Site Domain oceanlibrary.com
Base Domain oceanlibrary.com
Scan Status Ok
Last Scan2024-09-17T05:28:03+00:00
Next Scan 2024-10-17T05:28:03+00:00

Last Scan

Scanned2024-09-17T05:28:03+00:00
URL https://oceanlibrary.com/robots.txt
Domain IPs 13.33.30.102, 13.33.30.29, 13.33.30.42, 13.33.30.96
Response IP 13.33.30.102
Found Yes
Hash 94b62ce060b908bbeb7109a372893306d3447795b337b5ced2e7325c60381f9a
SimHash eb5551187b10

Groups

twitterbot/1.0

Rule Path
Disallow

twitterbot/0.1

Rule Path
Disallow

telegrambot (like twitterbot)

Rule Path
Disallow

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /app

*

Rule Path
Disallow /link

*

Rule Path
Disallow /saved_for_offline

*

Rule Path
Disallow /mac

*

Rule Path
Disallow /win

*

Rule Path
Disallow /linux

*

Rule Path
Disallow /electron

*

Rule Path
Disallow /ios

*

Rule Path
Disallow /android

*

Rule Path
Disallow /editor

Other Records

Field Value
sitemap https://oceanlibrary.com/server//sitemap/sitemap.xml
sitemap https://oceanlibrary.com/server//sitemap/sitemapimages.xml