avsafetydata.com
robots.txt

Robots Exclusion Standard data for avsafetydata.com

Resource Scan

Scan Details

Site Domain avsafetydata.com
Base Domain avsafetydata.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-27T08:41:18+00:00
Next Scan 2026-01-25T08:41:18+00:00

Last Successful Scan

Scanned2024-12-09T04:04:29+00:00
URL https://avsafetydata.com/robots.txt
Redirect https://asn.flightsafety.org/robots.txt
Redirect Domain asn.flightsafety.org
Redirect Base flightsafety.org
Domain IPs 212.7.211.34
Redirect IPs 13.33.28.119, 13.33.28.25, 13.33.28.30, 13.33.28.79
Response IP 13.33.28.79
Found Yes
Hash 00b8067e70d2e1d1d282b608e12cf3c2df588cef5a199ba5a6eade04bbbab63d
SimHash 5008c940a4a1

Groups

amazonbot
anthropic-ai
applebot-extended
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
imagesiftbot
img2dataset
meta-externalagent
oai-searchbot
omgili
omgilibot
perplexitybot
timpibot
velenpublicwebcrawler
youbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

simplecrawler

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

node/wc-simplecrawler 1.1.13

Rule Path
Disallow /

googlebot

Rule Path
Disallow /photos/gallery.php
Disallow /wikibase/web_db_edit.php
Disallow /wikibase/edit/*

zoombot

Rule Path
Disallow *

femtosearchbot

Rule Path
Disallow *

oai-searchbot

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

amazonbot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

applebot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

http://www.almaden.ibm.com/cs/crawler

Rule Path
Disallow /

http://www.almaden.ibm.com/cs/crawler [st1]

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /about/
Disallow /lib/
Disallow /wikibase/web_db_edit.php
Disallow /wikibase/edit/*
Disallow /database/safety-recommendations/safetyrecs-by-occurrence.php
Disallow /database/types/Douglas-DC-3/database/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*/*/*/*/*/*
Disallow /database/types/Douglas-DC-3/database/*/*/*/*/*/*/*/*/*/*/*/*

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

cliqzbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /