goal.pl
robots.txt

Robots Exclusion Standard data for goal.pl

Resource Scan

Scan Details

Site Domain goal.pl
Base Domain goal.pl
Scan Status Ok
Last Scan2024-11-12T13:22:42+00:00
Next Scan 2024-11-19T13:22:42+00:00

Last Scan

Scanned2024-11-12T13:22:42+00:00
URL https://goal.pl/robots.txt
Redirect https://www.goal.pl/robots.txt
Redirect Domain www.goal.pl
Redirect Base goal.pl
Domain IPs 104.21.28.175, 172.67.146.239, 2606:4700:3030::6815:1caf, 2606:4700:3037::ac43:92ef
Redirect IPs 104.21.28.175, 172.67.146.239, 2606:4700:3030::6815:1caf, 2606:4700:3037::ac43:92ef
Response IP 172.67.146.239
Found Yes
Hash c268eb184ad535b4451e16a879fe51ef9aec0ef7aa469c45682394ad8c5007b5
SimHash 4511a506e292

Groups

*

Rule Path
Disallow /index.php?dzial=archiwum*
Disallow /galeria/
Disallow /luba/*
Disallow /goto/
Disallow /bc-access/*
Disallow /*/feed/
Disallow /*/*/feed/
Disallow /*?keywords=*

Other Records

Field Value
sitemap https://www.goal.pl/sitemap_index.xml
sitemap https://www.goal.pl/news-sitemap.xml
sitemap https://www.goal.pl/author-sitemap.xml
sitemap https://www.goal.pl/klub-sitemap.xml
sitemap https://www.goal.pl/pilkarze-sitemap.xml
sitemap https://www.goal.pl/ligi-sitemap.xml

Comments

  • Sitemaps