jobrobot.de
robots.txt

Robots Exclusion Standard data for jobrobot.de

Resource Scan

Scan Details

Site Domain jobrobot.de
Base Domain jobrobot.de
Scan Status Ok
Last Scan2024-06-20T04:26:04+00:00
Next Scan 2024-07-20T04:26:04+00:00

Last Scan

Scanned2024-06-20T04:26:04+00:00
URL https://jobrobot.de/robots.txt
Redirect https://www.jobrobot.de/robots.txt
Redirect Domain www.jobrobot.de
Redirect Base jobrobot.de
Domain IPs 185.28.158.141, 2a00:9e20:191::b91c:9e8d
Redirect IPs 185.28.158.141, 2a00:9e20:191::b91c:9e8d
Response IP 185.28.158.141
Found Yes
Hash dfef00438ca73f8939abfe11c150fa3f8ef511cba1caf46834831ca0e20b94f6
SimHash 6814c5d4d616

Groups

*

Rule Path
Disallow /kooperationen/
Disallow /stellenanzeigen-firmenlogos/
Disallow /logos-arbeitgeber/
Disallow /jobsuche_2022.php3
Disallow /jobsuche_2024_ohne_jobslots.php3
Disallow /protokoll_url.php3
Disallow /protokoll_cluster.php3
Disallow /show_job.php3
Disallow /get_job.php3
Disallow /content_0400_jobsuche_test.htm
Disallow /gifs/impressum1.gif
Disallow /gifs/impressum2.gif
Disallow /content_0200_stellengesuch.htm

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3600

webmeasurement-bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /