agrofoto.pl
robots.txt

Robots Exclusion Standard data for agrofoto.pl

Resource Scan

Scan Details

Site Domain agrofoto.pl
Base Domain agrofoto.pl
Scan Status Ok
Last Scan2026-01-20T03:28:28+00:00
Next Scan 2026-02-19T03:28:28+00:00

Last Scan

Scanned2026-01-20T03:28:28+00:00
URL https://agrofoto.pl/robots.txt
Domain IPs 136.243.115.101
Response IP 136.243.115.101
Found Yes
Hash 57c791f473bdc1e07157e4f8ce56a55ad99d0ce59e8f5db3623d7fdaba826625
SimHash db423da7c98c

Groups

*

Rule Path
Disallow /forum/startTopic/
Disallow /forum/discover/unread/
Disallow /forum/markallread/
Disallow /forum/staff/
Disallow /forum/cookie/
Disallow /forum/online/
Disallow /forum/discover/
Disallow /forum/leaderboard/
Disallow /forum/search/
Disallow /forum/*?advancedSearchForm=
Disallow /forum/register/
Disallow /forum/lostpassword/
Disallow /forum/login/
Disallow /forum/*?sortby=
Disallow /forum/*?filter=
Disallow /forum/*?tab=
Disallow /forum/*?do=
Disallow /forum/*ref%3D
Disallow /forum/*?forumId*
Disallow /forum/*?&controller=embed

Other Records

Field Value
sitemap https://www.agrofoto.pl/forum/sitemap.php

Comments

  • Rules for Invision Community (https://invisioncommunity.com)
  • Block pages with no unique content
  • Disallow: /forum/tags/
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links
  • Disallow: /forum/profile/
  • Sitemap URL