planeta.by
robots.txt

Robots Exclusion Standard data for planeta.by

Resource Scan

Scan Details

Site Domain planeta.by
Base Domain planeta.by
Scan Status Ok
Last Scan2024-05-27T21:15:51+00:00
Next Scan 2024-06-03T21:15:51+00:00

Last Scan

Scanned2024-05-27T21:15:51+00:00
URL https://planeta.by/robots.txt
Domain IPs 104.21.0.170, 172.67.128.34, 2606:4700:3031::6815:aa, 2606:4700:3035::ac43:8022
Response IP 104.21.0.170
Found Yes
Hash d490abe918a03f76ba6d8241fef00d7b326ced904a81054f00f7971c4956c3c0
SimHash 0b3558e1a6a2

Groups

*

Rule Path
Allow /js/*
Allow /*.js
Allow /*.css
Allow /css/*
Allow /wp-includes/js/
Allow /wp-includes/css/
Allow /wp-content/cache/
Allow /wp-content/themes/
Allow /wp-content/plugins/
Disallow /tag/
Disallow */feed
Disallow /*E2e%3D
Disallow */?s=*
Disallow */?p=*
Disallow /*type%3D
Disallow /cgi-bin
Disallow /*etext%3D
Disallow /wp-json/
Disallow /wp-admin
Disallow /*action%3D
Disallow /*tpclid%3D
Disallow /trackback
Disallow /*PAGEN_1%3D
Disallow /*hhtmFrom%3D
Disallow */trackback
Disallow /*_x_tr_sl%3D
Disallow /*_ym_debug%3D
Disallow /*kitchen%5B%5D%3D
Disallow */*/trackback
Disallow /*utm_referrer%3D
Disallow /type-specialists
Disallow /*__iloveadaptive-hash__%3D

googlebot-image

Rule Path
Allow /

yandeximages

Rule Path
Allow /

Other Records

Field Value
sitemap https://planeta.by/sitemap_index.xml