elephant.se
robots.txt

Robots Exclusion Standard data for elephant.se

Resource Scan

Scan Details

Site Domain elephant.se
Base Domain elephant.se
Scan Status Ok
Last Scan2025-09-19T16:42:28+00:00
Next Scan 2025-09-26T16:42:28+00:00

Last Scan

Scanned2025-09-19T16:42:28+00:00
URL https://elephant.se/robots.txt
Domain IPs 2a02:2350:5:10e:63:922:c6ed:f9ca, 46.30.215.12
Response IP 46.30.215.12
Found Yes
Hash 6729bd5251b28b0c1b9ec27b03516d081f540b4e9437fa91b9cce89a144ba0f4
SimHash 6a131a5536c1

Groups

*

Rule Path
Disallow /archived_files/
Disallow /admin/
Disallow /admin/upload_picture.php
Disallow /js/
Disallow /tmp/
Disallow /private/
Disallow /images/
Disallow /books/
Disallow /members/
Disallow /logos/
Disallow /ads/
Disallow /inc/
Disallow /banner/
Disallow /icons/
Disallow /maps
Disallow /flags/
Disallow /photos/
Disallow /sounds/
Disallow /*.doc$
Disallow /*.inc$
Disallow /*.gif$
Disallow /*.jpg$
Disallow /*.jpeg$
Disallow /*.js$
Disallow /*.css$
Disallow /searchtools-rss.xml
Disallow /header.inc.php
Disallow /footer.inc.php

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap http://www.elephant.se/sitemap.xml

Comments

  • robots.txt for http://www.elephant.se/
  • Disallow: /*.htm$
  • Disallow: /*.html$
  • updated 2025-04-14 delay, before 2004: disallow rtestprob links