aatis.de
robots.txt

Robots Exclusion Standard data for aatis.de

Resource Scan

Scan Details

Site Domain aatis.de
Base Domain aatis.de
Scan Status Ok
Last Scan2025-10-04T10:43:15+00:00
Next Scan 2025-11-03T10:43:15+00:00

Last Scan

Scanned2025-10-04T10:43:15+00:00
URL https://aatis.de/robots.txt
Redirect https://www.aatis.de/content/robots.txt
Redirect Domain www.aatis.de
Redirect Base aatis.de
Domain IPs 85.13.153.41
Redirect IPs 85.13.153.41
Response IP 85.13.153.41
Found Yes
Hash 43a1d139dd550886ffc1f3959489e73eecf235717940c89fecd32158ef2882d7
SimHash 2e1c9d094564

Groups

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

yandex

Rule Path
Disallow /

botonparade

Rule Path
Disallow /

amibot

Rule Path
Disallow /

gonzo*

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

gingercrawler

Rule Path
Disallow /

iccrawler - icjobs

Rule Path
Disallow /

slurp

Rule Path
Disallow /

googlebot-image

Rule Path
Allow /content/

googlebot-mobile

Rule Path
Allow /content/

yahoo-mmcrawler

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

asterias

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

urlspion

Rule Path
Disallow /

googlebot

Rule Path
Allow /content/

rbot

Rule Path
Disallow /

*

Rule Path
Disallow /BAUSATZ
Allow /content/

Other Records

Field Value
crawl-delay 120

metajobbot

Rule Path
Disallow /
Disallow /content/sites/default/files/css
Disallow /content/sites/default/files/color
Disallow /content/sites/default/files/js
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /CHANGELOG.txt

*

Rule Path
Disallow /AAAevil.php
Allow /AAAnice.php

cliqzbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://aatis.de/content/sitemap.xml

Comments

  • $Id: robots.txt,v 1.9.2.1 2008/12/10 20:12:19 goba Exp $
  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • For syntax checking, see:
  • http://www.frobee.com/robots-txt-check
  • Directories
  • Files