tinhhoa.net
robots.txt
Robots Exclusion Standard data for tinhhoa.net
Resource Scan
Scan Details
Site Domain | tinhhoa.net |
Base Domain | tinhhoa.net |
Scan Status | Ok |
Last Scan | 2024-10-05T18:06:42+00:00 |
Next Scan | 2024-10-12T18:06:42+00:00 |
Last Scan
Scanned | 2024-10-05T18:06:42+00:00 |
URL | https://tinhhoa.net/robots.txt |
Domain IPs | 104.21.235.205, 104.21.235.206, 2606:4700:3038::6815:ebcd, 2606:4700:3038::6815:ebce |
Response IP | 104.21.235.206 |
Found | Yes |
Hash | e0ffb105f2906c8e5b631f7de7259fb8594517a604679d1204accca47565d465 |
SimHash | 91059b448f37 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Allow | /ads.txt |
Disallow | /wp-admin/* |
Disallow | /wp-includes/* |
Disallow | /search?q=* |
Disallow | /images/ |
Disallow | /counter/* |
Disallow | /cronjob/* |
Disallow | /data/* |
Disallow | /ho-so/* |
Disallow | /author/* |
Disallow | /tag/* |
Other Records
Field | Value |
---|---|
sitemap | https://tinhhoa.net/sitemap_index.xml |