jobs-in-berlin.info
robots.txt

Robots Exclusion Standard data for jobs-in-berlin.info

Archived Snapshots

Resource Scan

Scan Details

Site Domain	jobs-in-berlin.info
Base Domain	jobs-in-berlin.info
Scan Status	Ok
Last Scan	2024-10-07T13:37:53+00:00
Next Scan	2024-11-06T13:37:53+00:00

Last Scan

Scanned	2024-10-07T13:37:53+00:00
URL	https://jobs-in-berlin.info/robots.txt
Redirect	https://www.jobs-in-berlin.info/robots.txt
Redirect Domain	www.jobs-in-berlin.info
Redirect Base	jobs-in-berlin.info
Domain IPs	157.90.49.3
Redirect IPs	157.90.49.3
Response IP	157.90.49.3
Found	Yes
Hash	9cea5c274b9375216e5b1d48a611c530697d33660c451ed6b4fab53b2fc3156e
SimHash	5bf4f271c175

Groups

*

Rule	Path
Disallow	/impressum.html
Disallow	/datenschutz.html
Disallow	/agb.pdf
Disallow	/agb.html
Disallow	/erweiterte-suche.html
Disallow	/suche.html
Disallow	/job.php
Disallow	/job.php*
Disallow	/unternehmen/

Rule

Path

Disallow

/impressum.html

Disallow

/datenschutz.html

Disallow

/agb.pdf

Disallow

/agb.html

Disallow

/erweiterte-suche.html

Disallow

/suche.html

Disallow

/job.php

Disallow

/job.php*

Disallow

/unternehmen/

msnbot

Rule	Path
Disallow	/impressum.html
Disallow	/datenschutz.html
Disallow	/agb.pdf
Disallow	/agb.html
Disallow	/erweiterte-suche.html
Disallow	/suche.html
Disallow	/job.php
Disallow	/job.php*
Disallow	/unternehmen/

Rule

Path

Disallow

/impressum.html

Disallow

/datenschutz.html

Disallow

/agb.pdf

Disallow

/agb.html

Disallow

/erweiterte-suche.html

Disallow

/suche.html

Disallow

/job.php

Disallow

/job.php*

Disallow

/unternehmen/

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

bingbot

Rule	Path
Disallow	/impressum.html
Disallow	/datenschutz.html
Disallow	/agb.pdf
Disallow	/agb.html
Disallow	/erweiterte-suche.html
Disallow	/suche.html
Disallow	/job.php
Disallow	/job.php*
Disallow	/unternehmen/

Rule

Path

Disallow

/impressum.html

Disallow

/datenschutz.html

Disallow

/agb.pdf

Disallow

/agb.html

Disallow

/erweiterte-suche.html

Disallow

/suche.html

Disallow

/job.php

Disallow

/job.php*

Disallow

/unternehmen/

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

panscient.com

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

netestate ne crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

lcc

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

jobs-in-berlin.inforobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

msnbot

Other Records

bingbot

Other Records

ltx71

panscient.com

seekport

seekport crawler

netestate ne crawler

amazonbot

lcc

seokicks

blexbot

jobs-in-berlin.info
robots.txt