hsctvn.com
robots.txt

Robots Exclusion Standard data for hsctvn.com

Resource Scan

Scan Details

Site Domain hsctvn.com
Base Domain hsctvn.com
Scan Status Ok
Last Scan5/18/2025, 5:28:57 AM
Next Scan 5/25/2025, 5:28:57 AM

Last Scan

Scanned5/18/2025, 5:28:57 AM
URL https://hsctvn.com/robots.txt
Domain IPs 104.21.67.52, 172.67.213.211, 2606:4700:3035::ac43:d5d3, 2606:4700:3037::6815:4334
Response IP 172.67.213.211
Found Yes
Hash 4f803a6798371423455b6c8f46d6361ee764eed71b48da3a059e852a7b86b346
SimHash 505d8370e690

Groups

*

Rule Path
Allow /

seekportbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

criteobot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

applebot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

ias-ie

Rule Path
Disallow /

Other Records

Field Value
sitemap https://hsctvn.com/sitemap/sitemap.xml