sputnik.by
robots.txt

Robots Exclusion Standard data for sputnik.by

Archived Snapshots

Resource Scan

Scan Details

Site Domain	sputnik.by
Base Domain	sputnik.by
Scan Status	Ok
Last Scan	2024-11-09T03:22:28+00:00
Next Scan	2024-11-16T03:22:28+00:00

Last Scan

Scanned	2024-11-09T03:22:28+00:00
URL	https://sputnik.by/robots.txt
Domain IPs	194.190.139.2
Response IP	194.190.139.2
Found	Yes
Hash	f7b14e7e6b7bf462ac7701379564e81caf4aef38d0bb2e893b0ab1ba22698cc0
SimHash	3c2bad87c713

Groups

*

Rule	Path
Disallow	*-print.html$
Disallow	/sys_*
Disallow	/search/
Disallow	/services/
Disallow	/cms/
Disallow	*/calendar.html
Disallow	/_editorial_preview_*
Disallow	/files/
Disallow	*/more.html
Disallow	/ig/

Rule

Path

Disallow

*-print.html$

Disallow

/sys_*

Disallow

/search/

Disallow

/services/

Disallow

/cms/

Disallow

*/calendar.html

Disallow

/_editorial_preview_*

Disallow

/files/

Disallow

*/more.html

Disallow

/ig/

Back to top

Other Records

Field	Value
sitemap	https://sputnik.by/sitemap_article_index.xml
sitemap	https://sputnik.by/sitemap_list_index.xml
sitemap	https://sputnik.by/sitemap_archive.xml

Field

Value

sitemap

https://sputnik.by/sitemap_article_index.xml

sitemap

https://sputnik.by/sitemap_list_index.xml

sitemap

https://sputnik.by/sitemap_archive.xml

Back to top

Warnings

`clean-param` is not a known field.

Back to top

sputnik.byrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Warnings

sputnik.by
robots.txt