apia.pl
robots.txt

Robots Exclusion Standard data for apia.pl

Resource Scan

Scan Details

Site Domain apia.pl
Base Domain apia.pl
Scan Status Ok
Last Scan2024-09-21T04:22:14+00:00
Next Scan 2024-10-21T04:22:14+00:00

Last Scan

Scanned2024-09-21T04:22:14+00:00
URL https://apia.pl/robots.txt
Redirect https://www.apia.pl/robots.txt
Redirect Domain www.apia.pl
Redirect Base apia.pl
Domain IPs 91.198.137.196
Redirect IPs 91.198.137.196
Response IP 91.198.137.196
Found Yes
Hash 440c759e06d91c98c71e485ec237f182dd05212a2b9b83cd05888f8fe6689bb5
SimHash 6306936b16b1

Groups

*

Rule Path
Allow /*?p=
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /*?Dir=*
Disallow /*?Dir=desc
Disallow /*?Dir=asc
Disallow /*?Limit=all
Disallow /*Order%3D*
Disallow /*?Mode=*
Disallow /*?s_style=*
Disallow /*?season=*
Disallow /*?s_type=*
Disallow /*?color=*
Disallow /*?q=*
Disallow /*?s_size=*
Disallow /*?gender=*
Disallow /*?kolor=*
Disallow /*?manufacturer=*
Disallow /*?nowosc=*
Disallow /*?occasion=*
Disallow /*?okazja=*
Disallow /*?plec=*
Disallow /*?price=*
Disallow /*?rozmiar_eu=*
Disallow /*?season=*
Disallow /*?style=*
Disallow /*?typ=*
Disallow /*?typ_obcasa=*
Disallow /*?wierzch=*
Disallow /*?wyprzedaz=*
Disallow /*?wysokosc_obcasa=*

zumbot
zmeu
zend_http_client
youdaobot
yodaobot
yisouspider
yamanalab-robot
xpymep\.exe
www\.integromedb\.org
wotbot
websitetheweb\.com
webindetail\.com
webcapture
wbsearchbot
visaduhoc\.info
turnitinbot
the\ incutio\ xml-rpc\ php\ library
surveybot
speedy
sosospider
solomonobot
sogou
socialsearcher
snoopy
sitebot
sistrix
shopwiki
seoengworldbot
semrushbot
searchmetrics
screenerbot
rojerbot
riddler
queryseekerspider
purebot
proximic
procogseobot
peoplepal
pagesinventory
openwebindex
ocelli
nextgensearchbot
netseer
netestate\ ne\ crawler
netcraftsurveyagent
ncbot
msie\ or\ firefox\ mutant
magpie\-crawler
ltbot
lipperhey
linkdex\.com
lindex\.com
libwww-perl
jikespider
jakarta\ commons-httpclient
ip\-web\-crawler\.com
indy\ library
gigabot
ftrf\:\ friendly
ezooms
ezinearticleslinkscanner
exabot
easouspider
dow\ jones\ searchbot
dotnetdotcom
dotbot
discoverybot
dinoping
dataprovider\.com
curious
compspybot
comodo-certificates-spider
comodo
clipish
charlotte
catchbot
butterfly
brandwatch\.net
baiduspider
backlinkcrawler
awcheckbot
aihitbot
ahrefsbot
add\ catalog
accelobot
aboundex
seo-crawling
blexbot
mj12bot
crawler
domaincrawler
spbot
scrapy
seokicks-robot
cliqzbot
linkdexbot
ucrawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.apia.pl/pub/sitemap/sitemap_pl.xml
sitemap https://www.apia.pl/sitemap/category_sitemap.xml
sitemap https://www.apia.pl/sitemap/sitemap_pl.xml
sitemap https://www.apia.pl/sitemap/warmer_product_sitemap.xml

Comments

  • Crawlers Setup
  • Directories
  • subcategories that are sorted or filtered.
  • Disallow: /*?p=*
  • Bots

Warnings

  • 18 invalid lines.