newspl.eu
robots.txt

Robots Exclusion Standard data for newspl.eu

Resource Scan

Scan Details

Site Domain newspl.eu
Base Domain newspl.eu
Scan Status Ok
Last Scan5/15/2025, 6:27:53 PM
Next Scan 5/22/2025, 6:27:53 PM

Last Scan

Scanned5/15/2025, 6:27:53 PM
URL https://newspl.eu/robots.txt
Domain IPs 104.21.85.58, 172.67.202.189, 2606:4700:3035::ac43:cabd, 2606:4700:3036::6815:553a
Response IP 104.21.85.58
Found Yes
Hash 7ae273c38f58da31e8494cde6e0fc282c1a6850df5487f5bfb567947752ace1f
SimHash 6856c3925791

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

apis-google

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

adsbot-google-mobile

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

crawl

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

checker

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

python

Rule Path
Disallow /

custo

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

bingpreview

Rule Path
Disallow

gigabot

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

*

Rule Path
Disallow
Disallow /cgi-bin/

Other Records

Field Value
sitemap https://novosti.sprosi.eu/sitemap_index.xml