valwaldeck.com
robots.txt

Robots Exclusion Standard data for valwaldeck.com

Resource Scan

Scan Details

Site Domain valwaldeck.com
Base Domain valwaldeck.com
Scan Status Ok
Last Scan2025-10-22T17:53:07+00:00
Next Scan 2025-10-29T17:53:07+00:00

Last Scan

Scanned2025-10-22T17:53:07+00:00
URL https://valwaldeck.com/robots.txt
Domain IPs 185.17.24.91
Response IP 185.17.24.91
Found Yes
Hash 0944c632d36ccbfba367e68b821c90726fbc029d23472f245587029f1b7bbda1
SimHash a1454dc376d1

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /bebook/
Disallow /CCBot/
Disallow /cgi-bin/
Disallow /CTH/
Disallow /CTHsong/
Disallow /danpoyntercd_files/
Disallow /Deirdre/
Disallow /dlg/
Disallow /ebooks/
Disallow /formbuilder/
Disallow /images/
Disallow /library/
Disallow /links/
Disallow /MyDD/
Disallow /recommends/
Disallow /seminars/
Disallow /styles/
Disallow /WPVAL/

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/2426028.xml.gz
sitemap http://cdn.attracta.com/sitemap/2426016.xml.gz

Comments

  • robots.txt for http://www.valwaldeck.com/
  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove
  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove