main.pl
robots.txt

Robots Exclusion Standard data for main.pl

Resource Scan

Scan Details

Site Domain main.pl
Base Domain main.pl
Scan Status Ok
Last Scan2025-08-18T19:18:35+00:00
Next Scan 2025-09-17T19:18:35+00:00

Last Scan

Scanned2025-08-18T19:18:35+00:00
URL https://main.pl/robots.txt
Domain IPs 104.26.6.14, 104.26.7.14, 172.67.68.152, 2606:4700:20::681a:60e, 2606:4700:20::681a:70e, 2606:4700:20::ac43:4498
Response IP 172.67.68.152
Found Yes
Hash fd41cc1ccdec0c23f896b837759dd234cb5c4e2cbd97a390ea8e464f32067d97
SimHash ed3d550847f2

Groups

*

Rule Path
Disallow /about/
Disallow /adx/
Disallow /captcha/
Disallow /cgi-bin/
Disallow /cookie/
Disallow /embed/
Disallow /feed/
Disallow /folder/
Disallow /include/
Disallow /index.php/
Disallow /login/
Disallow /media/
Disallow /node/
Disallow /page/
Disallow /p-content/
Disallow /portfolio-category/
Disallow /portfolio-item/
Disallow /portfolio-types/
Disallow /project/
Disallow /sliders/
Disallow /trackback/
Disallow /uncategorized/
Disallow /uploads/
Disallow /test/
Disallow /wp-admin/
Disallow /wp-icludes/
Disallow /wp-content/uploads/media-from-ftp-tmp/
Disallow /api.php
Disallow /cron.php
Disallow /get.php
Disallow /cron.sh
Disallow /error_log
Disallow /wp-login.php
Disallow /install.php
Disallow /xmlrpc.php
Disallow /portal.html
Disallow /purposes.json
Disallow /*%26controller%3D*
Disallow /*%26referraluserkey%3D*
Disallow /*?utm_source=*
Disallow /*?fbclid=*
Disallow /*?p=*
Disallow /newsletter-thx/
Disallow /filtry/
Disallow /filtry/*