airflow.com
robots.txt

Robots Exclusion Standard data for airflow.com

Resource Scan

Scan Details

Site Domain airflow.com
Base Domain airflow.com
Scan Status Ok
Last Scan2024-09-24T03:41:55+00:00
Next Scan 2024-10-24T03:41:55+00:00

Last Scan

Scanned2024-09-24T03:41:55+00:00
URL https://airflow.com/robots.txt
Redirect https://www.airflow.com/robots.txt
Redirect Domain www.airflow.com
Redirect Base airflow.com
Domain IPs 20.13.109.145
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash b9bd7b6bb4666d92ed35cedbf382a8ca78fbadc15430e0f76a93095f42a6b02a
SimHash 2614dd11fe78

Groups

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

googlebot

Rule Path
Disallow /cms/cms.jsp?menu_id=30152
Disallow /Delivery-Address
Disallow /Delivery-Charges
Disallow /cms/cms.jsp?menu_id=30096&choice=YES
Disallow /Credit-Checks
Disallow /Checkout-Complete
Disallow /Login
Disallow /ForgottenPassword
Disallow /Basket
Disallow */cms.jsp*
Disallow /product?prodref=*

googlebot-image

Rule Path
Disallow /cms/cms.jsp?menu_id=30152
Disallow /Delivery-Address
Disallow /Delivery-Charges
Disallow /cms/cms.jsp?menu_id=30096&choice=YES
Disallow /Credit-Checks
Disallow /Checkout-Complete
Disallow /Login
Disallow /ForgottenPassword
Disallow /Basket
Disallow */cms.jsp*
Disallow /product?prodref=*

bingbot

Rule Path
Disallow /cms/cms.jsp?menu_id=30152
Disallow /Delivery-Address
Disallow /Delivery-Charges
Disallow /cms/cms.jsp?menu_id=30096&choice=YES
Disallow /Credit-Checks
Disallow /Checkout-Complete
Disallow /Login
Disallow /ForgottenPassword
Disallow /Basket
Disallow */cms.jsp*
Disallow /product?prodref=*

Other Records

Field Value
crawl-delay 1

msnbot

Rule Path
Disallow /cms/cms.jsp?menu_id=30152
Disallow /Delivery-Address
Disallow /Delivery-Charges
Disallow /cms/cms.jsp?menu_id=30096&choice=YES
Disallow /Credit-Checks
Disallow /Checkout-Complete
Disallow /Login
Disallow /ForgottenPassword
Disallow /Basket
Disallow */cms.jsp*
Disallow /product?prodref=*

blexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

megaindex.com

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

screaming frog seo spider

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

petalbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

seekportbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

baiduspider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Comments

  • BLEXBOT http://webmeup-crawler.com/ - SEO
  • https://ahrefs.com/robot - SEO
  • Majestic http://mj12bot.com/ - SEO
  • http://www.semrush.com/bot.html - SEO
  • https://megaindex.com/crawler - SEO
  • Has a maximum crawl delay of 5
  • https://aspiegel.com/petalbot - Petal Search (Huawei Assistant and AI Search services)
  • Doesn't respect crawl-delay
  • http://yandex.com/bots - Yandex Search Engine (Russian)
  • Doesn't respect crawl-delay
  • --
  • SEO Crawlers
  • BLEXBOT http://webmeup-crawler.com/ - SEO
  • https://ahrefs.com/robot - SEO
  • Majestic http://mj12bot.com/ - SEO
  • http://www.semrush.com/bot.html - SEO
  • https://megaindex.com/crawler - SEO
  • Has a maximum crawl delay of 5
  • https://opensiteexplorer.org/dotbot - SEO
  • https://www.screamingfrog.co.uk/seo-spider - SEO
  • Doesn't respect crawl-delay
  • Agressive Search Engines
  • http://www.bing.com/bingbot.htm - Bing Search
  • https://aspiegel.com/petalbot - Petal Search (Huawei Assistant and AI Search services)
  • Doesn't respect crawl-delay
  • http://yandex.com/bots - Yandex Search Engine (Russian)
  • Doesn't respect crawl-delay
  • https://developer.amazon.com/support/amazonbot - Alexa indexing
  • Doesn't respect crawl-delay
  • https://bot.seekport.com - German Search Engine
  • bytedance spiders
  • bytedance spiders
  • bytedance spiders
  • bytedance spiders
  • bytedance spiders
  • chatgpt bot

Warnings

  • 2 invalid lines.