gdz.by
robots.txt

Robots Exclusion Standard data for gdz.by

Resource Scan

Scan Details

Site Domain gdz.by
Base Domain gdz.by
Scan Status Ok
Last Scan2024-06-03T23:57:22+00:00
Next Scan 2024-07-03T23:57:22+00:00

Last Scan

Scanned2024-06-03T23:57:22+00:00
URL https://gdz.by/robots.txt
Domain IPs 104.26.6.190, 104.26.7.190, 172.67.71.68, 2606:4700:20::681a:6be, 2606:4700:20::681a:7be, 2606:4700:20::ac43:4744
Response IP 172.67.71.68
Found Yes
Hash 0b61127ec0ec28926921b68e982d63628d372fae95325805f706f06191abf90b
SimHash b4989507ab97

Groups

*

Rule Path
Allow /
Allow /*.js
Allow /*.css
Allow /*.webp
Disallow /*?
Allow /*.css?*
Disallow /confirm
Disallow /index/1
Disallow /index/3
Disallow /index/5
Disallow /index/7
Disallow /index/8
Disallow /index/9
Disallow /index/sub/
Disallow /panel/
Disallow /register
Disallow /register2
Disallow /verify
Disallow /stat/
Disallow /admin/
Disallow /informer/
Disallow /secure/
Disallow /poll/
Disallow /search/
Disallow /abnl/
Disallow /*_escaped_fragment_%3D
Disallow /*-*-*-*-987$
Disallow /*0-*-0-17$
Disallow /*-0-0-
Disallow /load/*
Disallow /noindex/*
Disallow */index
Disallow */search
Disallow *redirect*
Disallow /*/*/.html
Disallow /board/*

Other Records

Field Value
sitemap https://gdz.by/sitemap-load.xml