bygoods.com
robots.txt

Robots Exclusion Standard data for bygoods.com

Resource Scan

Scan Details

Site Domain bygoods.com
Base Domain bygoods.com
Scan Status Ok
Last Scan2024-10-05T21:06:43+00:00
Next Scan 2024-11-04T21:06:43+00:00

Last Scan

Scanned2024-10-05T21:06:43+00:00
URL https://bygoods.com/robots.txt
Domain IPs 64.23.161.198
Response IP 64.23.161.198
Found Yes
Hash e0588a2977c5ca56d6035f13be915617f003ee2ad821fd6c99781e62bc63ea47
SimHash 9536f289e6f3

Groups

serpstatbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

paracrawl

Rule Path
Disallow /

scrapy/1.5.0

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

velenpublicwebcrawler (velen.io)

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

semrushbot/2~bl

Rule Path
Disallow /

semrushbot/6~bl

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot/6.1

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /404/
Disallow /app/
Disallow /cgi-bin/
Disallow /downloader/
Disallow /errors/
Disallow /includes/
Disallow /js/
Disallow /lib/
Disallow /magento/
Disallow /pkginfo/
Disallow /report/
Disallow /scripts/
Disallow /shell/
Disallow /skin/
Disallow /stats/
Disallow /var/
Disallow /sales/
Disallow /rss/
Disallow /order/
Disallow /result/
Disallow /by-orders/
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /checkout/
Disallow /control/
Disallow /contacts/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /poll/
Disallow /review/
Disallow /sendfriend/
Disallow /wishlist/
Disallow /onestepcheckout/
Disallow /shipping-methods/
Disallow /return-policy/
Disallow /about-us/
Disallow /payment-methods/
Disallow /tax/
Disallow /home
Disallow /customer-service/
Disallow /cron.php
Disallow /cron.sh
Disallow /error_log
Disallow /install.php
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /STATUS.txt
Disallow /admin.php
Disallow /login.php
Disallow /*.php$
Disallow /*?SID=
Disallow /*?q=
Disallow /*?dir=*
Disallow /*?limit=*
Disallow /*?q=
Disallow /*?limit=*

Comments

  • Google Image Crawler Setup
  • Crawlers Setup
  • Directories
  • Paths (clean URLs)
  • Files
  • Paths (no clean URLs)