apia.com
robots.txt

Robots Exclusion Standard data for apia.com

Resource Scan

Scan Details

Site Domain apia.com
Base Domain apia.com
Scan Status Ok
Last Scan2024-09-28T07:49:16+00:00
Next Scan 2024-10-28T07:49:16+00:00

Last Scan

Scanned2024-09-28T07:49:16+00:00
URL https://apia.com/robots.txt
Redirect https://www.apia.com/robots.txt
Redirect Domain www.apia.com
Redirect Base apia.com
Domain IPs 91.198.137.196
Redirect IPs 91.198.137.196
Response IP 91.198.137.196
Found Yes
Hash 08a47b0e7199e5d7dba4d9d3353967594f41926091eff657a31efb226db22fa3
SimHash 6b36936b06b1

Groups

*

Rule Path
Allow /*?p=
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /*?Dir=*
Disallow /*?Dir=desc
Disallow /*?Dir=asc
Disallow /*?Limit=all
Disallow /*Order%3D*
Disallow /*?Mode=*
Disallow /*?s_style=*
Disallow /*?season=*
Disallow /*?s_type=*
Disallow /*?color=*
Disallow /*?q=*
Disallow /*?s_size=*
Disallow /*?gender=*
Disallow /*?kolor=*
Disallow /*?manufacturer=*
Disallow /*?nowosc=*
Disallow /*?occasion=*
Disallow /*?okazja=*
Disallow /*?plec=*
Disallow /*?price=*
Disallow /*?rozmiar_eu=*
Disallow /*?season=*
Disallow /*?style=*
Disallow /*?typ=*
Disallow /*?typ_obcasa=*
Disallow /*?wierzch=*
Disallow /*?wyprzedaz=*
Disallow /*?wysokosc_obcasa=*

zumbot
zmeu
zend_http_client
youdaobot
yodaobot
yisouspider
yamanalab-robot
xpymep\.exe
www\.integromedb\.org
wotbot
websitetheweb\.com
webindetail\.com
webcapture
wbsearchbot
visaduhoc\.info
turnitinbot
the\ incutio\ xml-rpc\ php\ library
surveybot
speedy
sosospider
solomonobot
sogou
socialsearcher
snoopy
sitebot
sistrix
shopwiki
seoengworldbot
semrushbot
searchmetrics
screenerbot
rojerbot
riddler
queryseekerspider
purebot
proximic
procogseobot
peoplepal
pagesinventory
openwebindex
ocelli
nextgensearchbot
netseer
netestate\ ne\ crawler
netcraftsurveyagent
ncbot
msie\ or\ firefox\ mutant
magpie\-crawler
ltbot
lipperhey
linkdex\.com
lindex\.com
libwww-perl
jikespider
jakarta\ commons-httpclient
ip\-web\-crawler\.com
indy\ library
gigabot
ftrf\:\ friendly
ezooms
ezinearticleslinkscanner
exabot
easouspider
dow\ jones\ searchbot
dotnetdotcom
dotbot
discoverybot
dinoping
dataprovider\.com
curious
compspybot
comodo-certificates-spider
comodo
clipish
charlotte
catchbot
butterfly
brandwatch\.net
baiduspider
backlinkcrawler
awcheckbot
aihitbot
ahrefsbot
add\ catalog
accelobot
aboundex
seo-crawling
blexbot
mj12bot
crawler
domaincrawler
spbot
scrapy
seokicks-robot
cliqzbot
linkdexbot
ucrawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.apia.com/pub/sitemap/sitemap_en.xml

Comments

  • Crawlers Setup
  • Directories
  • subcategories that are sorted or filtered.
  • Disallow: /*?p=*
  • Bots

Warnings

  • 18 invalid lines.