giathuocbaonhieu.com
robots.txt

Robots Exclusion Standard data for giathuocbaonhieu.com

Resource Scan

Scan Details

Site Domain giathuocbaonhieu.com
Base Domain giathuocbaonhieu.com
Scan Status Ok
Last Scan2024-11-16T04:22:26+00:00
Next Scan 2024-11-23T04:22:26+00:00

Last Scan

Scanned2024-11-16T04:22:26+00:00
URL https://giathuocbaonhieu.com/robots.txt
Domain IPs 104.21.64.182, 172.67.154.76, 2606:4700:3030::6815:40b6, 2606:4700:3031::ac43:9a4c
Response IP 172.67.154.76
Found Yes
Hash 9e3be2dbdcea9f76ce5ec3bac918f0951ee3595e9ef5730a81b2ebcb31a74529
SimHash d60b7c06ced3

Groups

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /
Disallow /trackback/
Disallow /feed/
Disallow */trackback/
Disallow */feed/
Disallow /?*
Disallow /*?

Other Records

Field Value
sitemap https://giathuocbaonhieu.com/post-sitemap.xml
sitemap https://giathuocbaonhieu.com/sitemap_index.xml