progroshi.news
robots.txt

Robots Exclusion Standard data for progroshi.news

Resource Scan

Scan Details

Site Domain progroshi.news
Base Domain progroshi.news
Scan Status Ok
Last Scan2024-11-13T00:17:26+00:00
Next Scan 2024-11-20T00:17:26+00:00

Last Scan

Scanned2024-11-13T00:17:26+00:00
URL https://progroshi.news/robots.txt
Domain IPs 104.26.4.175, 104.26.5.175, 172.67.70.43, 2606:4700:20::681a:4af, 2606:4700:20::681a:5af, 2606:4700:20::ac43:462b
Response IP 172.67.70.43
Found Yes
Hash 9d22b664f6e4d275bea32a8f5cc7063b668f350bc971ffeeb786b585ec288a16
SimHash 2d0d18428cf1

Groups

*

Rule Path
Disallow /_block/
Disallow /_page/
Disallow /quiz_get/
Disallow /media/get/
Disallow /news/view/inc/
Disallow /ru/news/view/inc/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://progroshi.news/sitemap/sitemap.xml
sitemap https://progroshi.news/xml/gnews_new.xml
sitemap https://progroshi.news/ru/xml/gnews_new.xml