americanwirenews.com
robots.txt

Robots Exclusion Standard data for americanwirenews.com

Resource Scan

Scan Details

Site Domain americanwirenews.com
Base Domain americanwirenews.com
Scan Status Ok
Last Scan2024-06-07T15:39:57+00:00
Next Scan 2024-06-14T15:39:57+00:00

Last Scan

Scanned2024-06-07T15:39:57+00:00
URL https://americanwirenews.com/robots.txt
Domain IPs 104.21.73.26, 172.67.137.248, 2606:4700:3031::6815:491a, 2606:4700:3032::ac43:89f8
Response IP 104.21.73.26
Found Yes
Hash c2172f2ccf0f5cb5fd0c4aeed14a01543cae4a4c180ca03cee7283fb127414a5
SimHash 6c115991e8a9

Groups

*

Rule Path
Disallow /?
Disallow /?*
Disallow /?author=*
Disallow /?s=
Disallow /*?count=*
Disallow /*?filter=
Disallow /*?p=*
Disallow /*?preview=*
Disallow /*?s=*
Disallow /*%26count%3D*
Disallow /*%26filter%3D
Disallow /*%26p%3D*
Disallow /*%26preview%3D*
Disallow /*%26s%3D*
Disallow /*add_to_wishlist%3D*
Disallow /*add-to-cart%3D*
Disallow /*cart/*
Disallow /*checkout/*
Disallow /*my-account/*
Disallow /*myaccount/*
Disallow /*orderby%3Ddate
Disallow /*orderby%3Ddesc
Disallow /*orderby%3Dpopularity
Disallow /*orderby%3Dprice
Disallow /*orderby%3Dprice-desc
Disallow /*orderby%3Drating
Disallow /*orderby%3Dtitle
Disallow /*paged%3D%26count%3D*
Disallow /*replytocom%3D*
Disallow /*wc-ajax%3Dadd_to_cart
Disallow /*wc-ajax%3Dremove_from_cart
Disallow /*wp-comments*
Disallow /*wp-feed*
Disallow /*wp-trackback*
Disallow /cart/*
Disallow /checkout/*
Disallow /members/
Disallow /my-account/*
Disallow /search/
Disallow /wp-admin/
Disallow /wp-content/uploads/*
Disallow /wp-login.php
Allow /wp-admin/admin-ajax.php
Allow /*/plugins/*

Other Records

Field Value
crawl-delay 30

parler
parlerstaging
bingbot
msnbot
msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

twitterbot
facebookexternalhit
facebot
googlebot

Rule Path
Allow *

ahrefsbot
ahrefssiteaudit
baiduspider
baiduspider-ads
baiduspider-cpro
baiduspider-favo
baiduspider-image
baiduspider-news
baiduspider-video
coccoc
dataforseobot
genieo
hoaxybot
laserlikebot
mail.ru
mj12bot
petalbot
qwantify
rogerbot
screaming frog seo spider
semrushbot
semrushbot-ba
semrushbot-bm
semrushbot-ct
semrushbot-sa
semrushbot-si
semrushbot-swa
seoscanners.net
seznambot
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider
sogou spider2
sogou web spider
spbot
splitsignalbot
storygizebot
yandex
yandexbot
yandeximages
yandexmobilebot

Rule Path
Disallow /

*

Rule Path
Disallow /*?p=*
Disallow /*%26p%3D*
Disallow /*?s=*
Disallow /*%26s%3D*
Disallow /?author=*
Disallow /*wp-comments*
Disallow /*wp-trackback*
Disallow /*wp-feed*
Disallow /*replytocom%3D*
Disallow /*?preview=*
Disallow /*%26preview%3D*
Disallow /*add-to-cart%3D*
Disallow /*add_to_wishlist%3D*
Disallow /*cart/*
Disallow /*checkout/*
Disallow /*my-account/*
Disallow /*myaccount/*
Allow /*/plugins/*

ahrefsbot

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

Other Records

Field Value
sitemap https://americanwirenews.com/sitemap_index.xml

Comments

  • Start Robots Customizations
  • Stop bots from crawling junk URLs
  • End Robots Customizations