gryc24.pl
robots.txt

Robots Exclusion Standard data for gryc24.pl

Resource Scan

Scan Details

Site Domain gryc24.pl
Base Domain gryc24.pl
Scan Status Ok
Last Scan2024-10-05T21:05:57+00:00
Next Scan 2024-11-04T21:05:57+00:00

Last Scan

Scanned2024-10-05T21:05:57+00:00
URL https://gryc24.pl/robots.txt
Domain IPs 95.216.101.140
Response IP 95.216.101.140
Found Yes
Hash a49d744574fc85713205bde0217c6b27f63b1ddeda3d658cb3e81b103039ff83
SimHash 5913c80043da

Groups

*

Rule Path
Disallow /register
Disallow /login
Disallow /sort%3D
Disallow /cPath
Disallow /add_wishlist
Disallow /contact_us
Disallow /search
Disallow /zapytaj
Disallow */compare/
Disallow */cart/
Disallow /pdf
Disallow *c%3D
Disallow /img
Allow *webp
Allow *jpg
Allow *png

seznambot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

lcc

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gryc24.pl/sitemap.xml

Comments

  • Disallow: /pdf_datasheet
  • Disallow: /pdf/catalog
  • Block Seznam Bot
  • Block BLEX Bot
  • Block MJ12 Bot
  • Block LCC