internetstandard.pl
robots.txt

Robots Exclusion Standard data for internetstandard.pl

Resource Scan

Scan Details

Site Domain internetstandard.pl
Base Domain internetstandard.pl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-31T20:39:26+00:00
Next Scan 2024-10-30T20:39:26+00:00

Last Successful Scan

Scanned2024-07-03T20:38:11+00:00
URL https://internetstandard.pl/robots.txt
Redirect https://www.internetstandard.pl/robots.txt
Redirect Domain www.internetstandard.pl
Redirect Base internetstandard.pl
Domain IPs 104.26.12.111, 104.26.13.111, 172.67.68.126, 2606:4700:20::681a:c6f, 2606:4700:20::681a:d6f, 2606:4700:20::ac43:447e
Redirect IPs 104.26.12.111, 104.26.13.111, 172.67.68.126, 2606:4700:20::681a:c6f, 2606:4700:20::681a:d6f, 2606:4700:20::ac43:447e
Response IP 104.26.12.111
Found Yes
Hash bb0f3d52b5940ecdef73846dd3f9e167bf1d8cedb3d89c302e483b635f070d3e
SimHash 691c8425e733

Groups

*

Rule Path
Allow /
Disallow /sonda/sonda_news.asp
Disallow /sonda/ajax/
Disallow /stats/
Disallow /block/
Disallow /ajax/
Disallow /auth/block/
Disallow /comment/block/
Disallow /news/block/
Disallow /news/ajax/
Disallow /poll/block/
Disallow /gallery/block/
Disallow /8456/IDG.PL_E_internetSTANDARD.pl/

Other Records

Field Value
sitemap https://www.internetstandard.pl/sitemap/sitemap_news.xml
sitemap https://www.internetstandard.pl/sitemap/sitemap_whitepaper.xml
sitemap https://www.internetstandard.pl/sitemap.xml