asurascans.us
robots.txt

Robots Exclusion Standard data for asurascans.us

Resource Scan

Scan Details

Site Domain asurascans.us
Base Domain asurascans.us
Scan Status Ok
Last Scan2024-09-23T05:30:49+00:00
Next Scan 2024-09-30T05:30:49+00:00

Last Scan

Scanned2024-09-23T05:30:49+00:00
URL https://asurascans.us/robots.txt
Domain IPs 104.21.1.246, 172.67.128.104, 2606:4700:3033::6815:1f6, 2606:4700:3033::ac43:8068
Response IP 172.67.128.104
Found Yes
Hash bf1c64d1cfcd2e41f78504c98434d17cebe3b172f6c2b117f15ca8cce355de43
SimHash 3336d391c57b

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Disallow /wp-content/uploads/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /wp-content/uploads/

applebot

Rule Path
Allow /

yandex

Rule Path
Allow /

yandeximages

Rule Path
Disallow /wp-content/uploads/

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

qwantify

Rule Path
Allow /

baiduspider

Rule Path
Allow /

sogou web spider

Rule Path
Allow /

sogou inst spider

Rule Path
Allow /

sosospider+

Rule Path
Allow /

youdaobot

Rule Path
Allow /

naverbot

Rule Path
Disallow /

seznambot

Rule Path
Allow /

facebook

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

facebookscraper

Rule Path
Allow /

facebot

Rule Path
Allow /

instagrambot

Rule Path
Allow /

whatsapp bot

Rule Path
Allow /

telegrambot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

pinterest bot

Rule Path
Allow /

discordbot

Rule Path
Allow /

*

Rule Path
Disallow /*.webp$

*

Rule Path
Disallow /*.jpg$

*

Rule Path
Disallow /*.png$

*

Rule Path
Disallow /*.gif$

*

Rule Path
Disallow /*.pdf$

*

Rule Path
Disallow /*.docx$

*

Rule Path
Disallow /*.html$

*

Rule Path
Disallow /*.php$
Disallow /wp-json/
Disallow /?rest_route=
Disallow /search/
Disallow /?s=

Other Records

Field Value
sitemap https://asurascans.us/sitemap_index.xml

Comments

  • Block Ahrefs Crawler
  • Block Semrush Crawler
  • Block Moz Crawler
  • Block Majestic Crawler
  • Block Chatgpt
  • Rankmath Sitemap Link
  • Allow Google Bot
  • Block Google Images Bot
  • Allow Google Media Partners Bot
  • Allow Google AdsBot Bot
  • Allow Google Mobile Bot
  • Allow Bing Bot
  • Allow MSN Bot
  • Allow MSNBot Media Bot
  • Allow Apple Bot
  • Allow Yandex Bot
  • Block Yandex Images Bot
  • Allow Yahoo Search (Slurp bot)
  • Allow DuckDuckGo Bot
  • Allow Qwant Bot
  • Allow Baidu/Sogou/Soso/Youdao Bot
  • Block Naver Bot
  • Allow Seznam Bot
  • Allow Facebook Bot
  • Allow Instagram Bot
  • Allow Whatsapp Bot
  • Allow Telegram Bot
  • Allow Twitter Bot
  • Allow Linkedin Bot
  • Allow Pinterest Bot
  • Allow Discord Bot
  • Block Webp Images
  • Block Jpg Images
  • Block Png Images
  • Block Gif Images
  • Block PDF Files
  • Block DOCX Files
  • Block Html Files
  • Block Php Files
  • Prevent Crawling of WordPress JSON API Endpoints
  • Block Search URLs /search/ and /?s=