stiripesurse.ro
robots.txt

Robots Exclusion Standard data for stiripesurse.ro

Resource Scan

Scan Details

Site Domain stiripesurse.ro
Base Domain stiripesurse.ro
Scan Status Ok
Last Scan2024-06-10T21:06:17+00:00
Next Scan 2024-06-17T21:06:17+00:00

Last Scan

Scanned2024-06-10T21:06:17+00:00
URL https://stiripesurse.ro/robots.txt
Redirect https://www.stiripesurse.ro/robots.txt
Redirect Domain www.stiripesurse.ro
Redirect Base stiripesurse.ro
Domain IPs 104.26.12.9, 104.26.13.9, 172.67.68.99, 2606:4700:20::681a:c09, 2606:4700:20::681a:d09, 2606:4700:20::ac43:4463
Redirect IPs 104.26.12.9, 104.26.13.9, 172.67.68.99, 2606:4700:20::681a:c09, 2606:4700:20::681a:d09, 2606:4700:20::ac43:4463
Response IP 104.26.12.9
Found Yes
Hash 4d64745cd51dc8cffdf0a4f06b989577ea430d05a8cc4490452f12483044d5ac
SimHash 60026be9a6b4

Groups

*

Rule Path
Disallow /print_
Disallow /securimage_show.php
Disallow /alert.html
Disallow /membri/
Disallow /cauta?s=

googlebot-image

Rule Path
Allow /

cxensebot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

cxensebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.stiripesurse.ro/sitemap/sitemap-index.xml
sitemap https://www.stiripesurse.ro/sitemap-google-news/sitemap_google_news.xml