gunsforsalesusa.com
robots.txt

Robots Exclusion Standard data for gunsforsalesusa.com

Resource Scan

Scan Details

Site Domain gunsforsalesusa.com
Base Domain gunsforsalesusa.com
Scan Status Ok
Last Scan2024-09-10T12:37:21+00:00
Next Scan 2024-10-10T12:37:21+00:00

Last Scan

Scanned2024-09-10T12:37:21+00:00
URL https://gunsforsalesusa.com/robots.txt
Domain IPs 2a02:4780:1:572:0:1dcd:4f3d:1, 31.170.160.118
Response IP 31.170.160.118
Found Yes
Hash 6d835b494b9988836784c526426ab8a5f0c6d03f6dc0d7085324f0c6bb4f7d58
SimHash ad35625f2092

Groups

*

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D

yandex
ichiro
moget
naverbot
baiduspider
baidu
sogou
youdao
mj12bot
goodzer
istellabot
gigabot
dotbot
seznambot
ltx71
spiderbot
opensiteexplorer
openlinkprofiler
majestic12
datagnion
sogou
ahrefs
scoutjet
changedetection
istellabot
napovedaseznam
linkfluence
smarter
mj12bot
mojeek
naver
deusu
slack
aihitdata
wotbox
kazbt
mediatoolkit
safedns
aboundex
webmeup-crawler
orangebot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /setup/
Disallow /update/
Disallow /var/
Disallow /vendor/
Disallow /composer.json
Disallow /composer.lock
Disallow /CONTRIBUTING.md
Disallow /CONTRIBUTOR_LICENSE_AGREEMENT.html
Disallow /COPYING.txt
Disallow /Gruntfile.js
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /nginx.conf.sample
Disallow /package.json
Disallow /php.ini.sample
Disallow /RELEASE_NOTES.txt
Disallow /*?*product_list_mode=
Disallow /*?*product_list_order=
Disallow /*?*product_list_limit=
Disallow /*?*product_list_dir=
Disallow /*.php$
Disallow /*?SID=
Disallow /*.CVS
Disallow /*.Zip$
Disallow /*.Svn$
Disallow /*.Idea$
Disallow /*.Sql$
Disallow /*.Tgz$

Other Records

Field Value
sitemap https://gunsforsalesusa.com/sitemap_index.xml

Comments

  • Google Image Crawler Setup
  • Crawlers Setup
  • Directories
  • Files
  • Do not index pages that are sorted or filtered.
  • Do not index session ID
  • CVS, SVN directory and dump files
  • Sitemap

Warnings

  • `https` is not a known field.