spacebox.com.hk
robots.txt

Robots Exclusion Standard data for spacebox.com.hk

Resource Scan

Scan Details

Site Domain spacebox.com.hk
Base Domain spacebox.com.hk
Scan Status Ok
Last Scan2024-09-18T15:28:49+00:00
Next Scan 2024-10-18T15:28:49+00:00

Last Scan

Scanned2024-09-18T15:28:49+00:00
URL https://spacebox.com.hk/robots.txt
Redirect https://www.spacebox.com.hk/robots.txt
Redirect Domain www.spacebox.com.hk
Redirect Base spacebox.com.hk
Domain IPs 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52
Redirect IPs 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52
Response IP 199.232.47.52
Found Yes
Hash 8aaa1b947c765866339a5a77fcd3e01179d49e69ad2ce6395570597292c7a7cb
SimHash aa042c0d08f0

Groups

*

Rule Path
Disallow /dora/result-help
Disallow /dora/result-success
Disallow /dora/result-box
Disallow /dora/result-help/
Disallow /dora/result-success/
Disallow /dora/result-box/
Disallow /dora/result-help?
Disallow /dora/result-success?
Disallow /dora/result-box?
Disallow /dora/
Disallow /security
Disallow /better-quote
Disallow /storage/chaiwan-storage/
Disallow /wp-admin/
Disallow /plugins/
Disallow /css/
Disallow /faqs/*
Disallow /zh-TW/faqs/*
Disallow /themes/
Disallow /assets/
Disallow /*?*
Disallow /tag/

Other Records

Field Value
sitemap https://www.spacebox.com.hk/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: