qvapehouse.com
robots.txt

Robots Exclusion Standard data for qvapehouse.com

Resource Scan

Scan Details

Site Domain qvapehouse.com
Base Domain qvapehouse.com
Scan Status Ok
Last Scan2024-09-08T09:30:04+00:00
Next Scan 2024-10-08T09:30:04+00:00

Last Scan

Scanned2024-09-08T09:30:04+00:00
URL https://qvapehouse.com/robots.txt
Domain IPs 104.26.8.218, 104.26.9.218, 172.67.69.135, 2606:4700:20::681a:8da, 2606:4700:20::681a:9da, 2606:4700:20::ac43:4587
Response IP 104.26.8.218
Found Yes
Hash 3336406da027c1bd745cb2462b36e43356f698a4d0b9aed804cd3f379b8b11b7
SimHash 6c4dc4309db0

Groups

*

Rule Path
Disallow /*wp-json/

*

Rule Path
Disallow /*?s=*
Disallow /*search/*

*

Rule Path
Disallow /*?wp-ajax=

*

Rule Path
Disallow /*cdn-cgi/bm/cv/
Disallow /*cdn-cgi/challenge-platform/
Disallow /*cdn-fpw/sxg/

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
mj12bot
psbot
curious george
turnitinbot
npbot-1/2.0
npbot

Rule Path
Disallow /*

Other Records

Field Value
sitemap https://www.qvapehouse.com/sitemap_index.xml
sitemap https://www.qvapehouse.com/eu/sitemap_index.xml
sitemap https://www.qvapehouse.com/si/sitemap_index.xml
sitemap https://www.qvapehouse.com/de/sitemap_index.xml

Comments

  • Global rules
  • -----------------
  • Internal search
  • -----------------
  • Leaky plugins
  • --------------------------------
  • Leaky Cloudflare endpoints
  • --------------------------------
  • Sitemaps
  • --------------------------------
  • Noisy bots
  • --------------------------------