nhacaivua.com
robots.txt

Robots Exclusion Standard data for nhacaivua.com

Resource Scan

Scan Details

Site Domain nhacaivua.com
Base Domain nhacaivua.com
Scan Status Ok
Last Scan2024-10-15T14:29:22+00:00
Next Scan 2024-11-14T14:29:22+00:00

Last Scan

Scanned2024-10-15T14:29:22+00:00
URL https://nhacaivua.com/robots.txt
Redirect http://nhacaivua.com/robots.txt
Domain IPs 104.21.70.63, 172.67.220.171, 2606:4700:3033::6815:463f, 2606:4700:3035::ac43:dcab
Response IP 172.67.220.171
Found Yes
Hash 644dbf9f44e314373baa59ba4ba8fba19231a3675aa7e67729099183b0758bb8
SimHash 103cf1c82d1a

Groups

baiduspider

Rule Path
Disallow /

yandexvideoparser

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

proximic

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

Warnings

  • 2 invalid lines.