domy.pl
robots.txt

Robots Exclusion Standard data for domy.pl

Resource Scan

Scan Details

Site Domain domy.pl
Base Domain domy.pl
Scan Status Ok
Last Scan2024-11-14T13:58:16+00:00
Next Scan 2024-11-21T13:58:16+00:00

Last Scan

Scanned2024-11-14T13:58:16+00:00
URL https://domy.pl/robots.txt
Domain IPs 104.26.14.116, 104.26.15.116, 172.67.73.131
Response IP 104.26.14.116
Found Yes
Hash e907ad8d0c3d6b45889f738ae91654fe9435314c34f275c80414785dbd5f2f67
SimHash 6c57c896e573

Groups

sistrix

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

seokicks robot

Rule Path
Disallow /

*

Rule Path
Disallow /podbicie/
Disallow /en/
Disallow /de/
Disallow /es/
Disallow /fr/
Disallow /ru/
Disallow /dlaagencji
Disallow /dladeweloperow
Disallow /regulamin
Disallow /cennik
Disallow /wspolpraca/
Disallow /podbicia
Disallow /ajaxRequirementContactSend
Disallow /requirementAdd/
Disallow /pl/requirement
Disallow /zapotrzebowanie/
Disallow /*/drukuj

Other Records

Field Value
sitemap http://domy.pl/sitemaps_pl/sitemap.xml

Comments

  • http://www.80legs.com/webcrawler.html
  • http://crawler.sistrix.net/
  • http://www.trendiction.de/bot
  • http://www.seokicks.de/robot.html
  • all spiders - keep at bottom

Warnings

  • 2 invalid lines.