tinhhoa.net
robots.txt

Robots Exclusion Standard data for tinhhoa.net

Resource Scan

Scan Details

Site Domain tinhhoa.net
Base Domain tinhhoa.net
Scan Status Ok
Last Scan2024-10-05T18:06:42+00:00
Next Scan 2024-10-12T18:06:42+00:00

Last Scan

Scanned2024-10-05T18:06:42+00:00
URL https://tinhhoa.net/robots.txt
Domain IPs 104.21.235.205, 104.21.235.206, 2606:4700:3038::6815:ebcd, 2606:4700:3038::6815:ebce
Response IP 104.21.235.206
Found Yes
Hash e0ffb105f2906c8e5b631f7de7259fb8594517a604679d1204accca47565d465
SimHash 91059b448f37

Groups

*

Rule Path
Allow /
Allow /ads.txt
Disallow /wp-admin/*
Disallow /wp-includes/*
Disallow /search?q=*
Disallow /images/
Disallow /counter/*
Disallow /cronjob/*
Disallow /data/*
Disallow /ho-so/*
Disallow /author/*
Disallow /tag/*

Other Records

Field Value
sitemap https://tinhhoa.net/sitemap_index.xml