cebule.pl
robots.txt

Robots Exclusion Standard data for cebule.pl

Resource Scan

Scan Details

Site Domain cebule.pl
Base Domain cebule.pl
Scan Status Ok
Last Scan2024-09-30T01:44:03+00:00
Next Scan 2024-10-30T01:44:03+00:00

Last Scan

Scanned2024-09-30T01:44:03+00:00
URL https://cebule.pl/robots.txt
Redirect https://www.cebule.pl/robots.txt
Redirect Domain www.cebule.pl
Redirect Base cebule.pl
Domain IPs 5.149.163.203
Redirect IPs 5.149.163.203
Response IP 5.149.163.203
Found Yes
Hash dc2eb06a405c2969595c9e618673211a512e0ac365060a915648c842dc8c411a
SimHash 529cf6508931

Groups

*

Rule Path
Disallow /*?rec=*
Disallow /*%26rec%3D*

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

fyberspider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot/2.0

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

*

Rule Path
Disallow /search.php
Disallow /noproduct.php
Disallow /signin.php
Allow /

Other Records

Field Value
sitemap https://www.cebule.pl/sitemap.xml.gz

Comments

  • Pages with rec parameter - IAI Recommendation System
  • Automatically banned scanners and crawlers section
  • Section end

Warnings

  • 3 invalid lines.