gemperles.com
robots.txt

Robots Exclusion Standard data for gemperles.com

Resource Scan

Scan Details

Site Domain gemperles.com
Base Domain gemperles.com
Scan Status Ok
Last Scan2025-12-25T10:55:34+00:00
Next Scan 2026-01-24T10:55:34+00:00

Last Scan

Scanned2025-12-25T10:55:34+00:00
URL https://www.gemperles.com/robots.txt
Domain IPs 172.66.40.153, 172.66.43.103, 2606:4700:3108::ac42:2899, 2606:4700:3108::ac42:2b67
Response IP 172.66.43.103
Found Yes
Hash ca2e63ccf3dae5756b87224aa8d72d624403791bdb7bb3f79390ccbcf7929fc9
SimHash 490e4b726aad

Groups

mj12bot
semrushbot
yandex
yandexbot
zoominfobot
um-ln
mtrobot
safednsbot
bytespider
bitlybot
barkrowle
coccocbot-image
petalbot
dotbot
blexbot
aspiegelbot
baiduspider
trovitbot
mail.ru_bot
seznambot
yisouspider
zh-cn
slackbot
slackbot-linkexpanding
claudebot
amazonbot
fidget-spinner-bot
pinterestbot
vebot
awariobot
geedoproductsearch
skypeuripreview
gptbot
chatgpt-user
meta-externalagent
facebookexternalhit
chatgpt-user

Rule Path
Disallow /
Disallow /index.php
Disallow /checkout
Disallow /app
Disallow /lib
Disallow /*.php*
Disallow /pkginfo
Disallow /report
Disallow /var
Disallow /catalog
Disallow /customer
Disallow /sendfriend
Disallow /review
Disallow /*SID*
Disallow /catalogsearch
Disallow /*price*
Disallow /*product_list_order*
Disallow /blog/tag
Disallow /*?%3E%24*
Disallow /*?*
Disallow /*?gem*
Allow /*?p*

*

Rule Path
Disallow /?dir
Disallow /?dir=desc
Disallow /?dir=asc
Disallow /?limit=all
Disallow /?mode*
Disallow /?color*
Disallow /*?color*
Disallow /*%26color*
Disallow /?tax_invoice*
Disallow /*?tax_invoice*
Disallow /*%26tax_invoice*
Disallow /?vendor_region*
Disallow /*?vendor_region*
Disallow /*%26vendor_region*
Disallow /?brand*
Disallow /*?brand*
Disallow /*%26brand*
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /tag/
Disallow /review/
Disallow /*?
Disallow /api*
Disallow /graphql*
Disallow /index.php
Disallow /*/*/*/switch
Disallow /*/*/*/redirect
Disallow /*/elasticsuite
Disallow /*/catalogsearch
Disallow /*/*/*?srsltid*
Disallow /*/*/*/*/___store/*/___from_store
Disallow /*/*/*/switch/?*
Disallow /*/manual-book-download
Disallow /*/search/ajax/suggest/?*
Disallow /*/*/*/*?srsltid*

*

Rule Path
Allow /*?brand=*
Allow /*/*?brand=*
Disallow /api
Disallow /graphql*
Disallow /*?
Disallow /catalog
Disallow /catalogsearch
Disallow /wishlist
Disallow /admin
Disallow /checkout
Disallow /onestepcheckout
Disallow /customer
Disallow /review
Disallow /sendfriend
Disallow /enable-cookies
Disallow /LICENSE.txt
Disallow /LICENSE.html
Disallow /skin
Disallow /js
Disallow /directory
Disallow /404
Disallow /*?dir*
Disallow /*?dir*
Disallow /*?limit*
Disallow /*/*?mode*
Disallow /*/*?dir*
Disallow /*/*?limit*
Disallow /*/*?mode*
Disallow /catalogsearch/result*

*

Rule Path
Disallow /*?utm*
Disallow /*?*&utm*
Disallow /*/*?utm*
Disallow /*/*?*&utm*

*

Rule Path
Disallow /*.csv$
Disallow /*.xls$
Disallow /*.json$
Disallow /*.asa$
Disallow /*.asax$
Disallow /*.ascx$
Disallow /*.axd$
Disallow /*.backup$
Disallow /*.bak$
Disallow /*.bat$
Disallow /*.cdx$
Disallow /*.cer$
Disallow /*.cfg$
Disallow /*.cmd$
Disallow /*.com$
Disallow /*.config$
Disallow /*.conf$
Disallow /*.cs$
Disallow /*.csproj$
Disallow /*.csr$
Disallow /*.dat$
Disallow /*.db$
Disallow /*.dbf$
Disallow /*.dll$
Disallow /*.dos$
Disallow /*.htr$
Disallow /*.htw$
Disallow /*.ida$
Disallow /*.idc$
Disallow /*.idq$
Disallow /*.inc$
Disallow /*.ini$
Disallow /*.key$
Disallow /*.licx$
Disallow /*.lnk$
Disallow /*.log$
Disallow /*.mdb$
Disallow /*.old$
Disallow /*.pass$
Disallow /*.pdb$
Disallow /*.pol$
Disallow /*.printer$
Disallow /*.pwd$
Disallow /*.rdb$
Disallow /*.resources$
Disallow /*.resx$
Disallow /*.sql$
Disallow /*.swp$
Disallow /*.sys$
Disallow /*.vb$
Disallow /*.vbs$
Disallow /*.vbproj$
Disallow /*.vsdisco$
Disallow /*.webinfo$
Disallow /*.xsd$
Disallow /*.xsx$
Disallow /*.md$
Disallow /*.bzr$
Disallow /*.MD$
Disallow /*._darcs$
Disallow /*.git$
Disallow /*.ssh$
Disallow /*.svn$
Disallow /*.class$

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

instagram

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

Other Records

Field Value
sitemap Sitemap: https://www.gemperles.com/sitemap.xml

Comments

  • Block Bad Bot
  • Gemperles rules
  • Allow Pagination
  • Rules
  • Filters
  • New Filters
  • Blocking CMS directories
  • Blocking duplicate content
  • Default Rules
  • Blocking UTM
  • Blocking extension. Hardening security purpose for defend from unknow hack tools
  • Crawl Delay For Decrease High Load

Warnings

  • 1 invalid line.