giaanproperty.vn
robots.txt

Robots Exclusion Standard data for giaanproperty.vn

Resource Scan

Scan Details

Site Domain giaanproperty.vn
Base Domain giaanproperty.vn
Scan Status Ok
Last Scan2024-11-14T16:04:16+00:00
Next Scan 2024-11-21T16:04:16+00:00

Last Scan

Scanned2024-11-14T16:04:16+00:00
URL https://giaanproperty.vn/robots.txt
Domain IPs 103.221.223.45
Response IP 103.221.223.45
Found Yes
Hash ad67d48fbe30032ece22ab96b04772ebedbdee773d3c2b563926a782f9f1b92c
SimHash 530dd8524711

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /wp-admin/images/*
Disallow /wp-includes/
Allow /wp-includes/js
Allow /wp-includes/css
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /feed/
Disallow /*/feed/*
Disallow /*/*?s=*
Disallow /*/*.inc$
Disallow /transfer/
Disallow /refer/
Disallow /*/cgi-bin/*
Disallow /*/blackhole/*
Disallow /*/trackback/*
Disallow /*/xmlrpc.php
Disallow /suggest/?*
Disallow /readme.html
Disallow /*?hpp_next=*

ahrefsbot
baiduspider
easouspider
ezooms
yandexbot
mj12bot
sitesucker
httrack
httrack website copier
teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
webcopier
offline explorer pro
offline commander
leech
websnake
blackwidow
http weazel

Rule Path
Disallow /

nutch

Rule Path
Disallow /

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinerest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.giaanproperty.vn/sitemap_index.xml
sitemap https://www.giaanproperty.vn/post-sitemap.xml
sitemap https://www.giaanproperty.vn/page-sitemap.xml
sitemap https://www.giaanproperty.vn/product-sitemap.xml
sitemap https://www.giaanproperty.vn/category-sitemap.xml
sitemap https://www.giaanproperty.vn/post_tag-sitemap.xml
sitemap https://www.giaanproperty.vn/product_cat-sitemap.xml

Comments

  • Disallow: /wp-content/plugins/
  • Disallow: /wp-content/themes/
  • protect my site from HTTrack or other software's ripping?
  • https://www.giaanproperty.vn/sitemap.xml