handelszeitung.ch
robots.txt

Robots Exclusion Standard data for handelszeitung.ch

Resource Scan

Scan Details

Site Domain handelszeitung.ch
Base Domain handelszeitung.ch
Scan Status Ok
Last Scan2024-11-15T11:03:57+00:00
Next Scan 2024-11-22T11:03:57+00:00

Last Scan

Scanned2024-11-15T11:03:57+00:00
URL https://handelszeitung.ch/robots.txt
Redirect https://www.handelszeitung.ch/robots.txt
Redirect Domain www.handelszeitung.ch
Redirect Base handelszeitung.ch
Domain IPs 104.21.40.154, 172.67.154.100, 2606:4700:3035::ac43:9a64, 2606:4700:3037::6815:289a
Redirect IPs 23.209.46.143, 23.209.46.162
Response IP 96.17.180.50
Found Yes
Hash 79a9312a73f7d9d61a92f9e1fd739af00bd7ed0f25f7357690de5ba83e1c9685
SimHash bc96911a0175

Groups

yandex

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

*

Rule Path
Disallow /veranstaltungskalender
Disallow /suche/
Disallow /authorize/
Disallow /logout/
Disallow /profile/
Disallow *?*email_address=
Disallow *?*form_build_id=
Disallow *?*form_id=
Disallow *?*search_form_block=
Disallow *?*view_dom_id=
Disallow *?*pkBerichtNr=
Disallow *?*Version=*
Disallow *?*Path=*
Disallow *?*_ptid=*

Other Records

Field Value
sitemap https://www.handelszeitung.ch/googlenews.xml
sitemap https://www.handelszeitung.ch/sitemap.xml
sitemap https://www.handelszeitung.ch/image-sitemap.xml
sitemap https://www.handelszeitung.ch/bilanz/sitemap.xml
sitemap https://www.handelszeitung.ch/bilanz/image-sitemap.xml
sitemap https://www.handelszeitung.ch/insurance/sitemap.xml
sitemap https://www.handelszeitung.ch/sitemap-authors.xml
sitemap https://www.handelszeitung.ch/bilanz/sitemap-authors.xml
sitemap https://www.handelszeitung.ch/insurance/sitemap-authors.xml
sitemap https://www.handelszeitung.ch/banking/sitemap-authors.xml

Comments

  • robots.txt Handelszeitung August 2019
  • handelszeitung.ch
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • Bot control
  • Directories
  • Sitemaps
  • Parameters