gep.com
robots.txt

Robots Exclusion Standard data for gep.com

Resource Scan

Scan Details

Site Domain gep.com
Base Domain gep.com
Scan Status Ok
Last Scan2024-06-05T03:08:26+00:00
Next Scan 2024-07-05T03:08:26+00:00

Last Scan

Scanned2024-06-05T03:08:26+00:00
URL https://gep.com/robots.txt
Redirect https://www.gep.com:443/robots.txt
Redirect Domain www.gep.com
Redirect Base gep.com
Domain IPs 13.248.188.103, 76.223.53.232
Redirect IPs 13.33.30.57, 13.33.30.71, 13.33.30.81, 13.33.30.92, 2600:9000:229f:6c00:1c:f167:7300:93a1, 2600:9000:229f:7800:1c:f167:7300:93a1, 2600:9000:229f:8600:1c:f167:7300:93a1, 2600:9000:229f:8800:1c:f167:7300:93a1, 2600:9000:229f:a00:1c:f167:7300:93a1, 2600:9000:229f:ac00:1c:f167:7300:93a1, 2600:9000:229f:b200:1c:f167:7300:93a1, 2600:9000:229f:da00:1c:f167:7300:93a1
Response IP 13.33.30.81
Found Yes
Hash 8046e4d0c3cd1a6ccf980968e85e3e358a432f196c16b010dcaf809134daa575
SimHash 081660a0cad0

Groups

*

Rule Path
Disallow /taxonomy/
Disallow /brochure-download/*
Disallow /includes/
Disallow /sem/
Disallow /campaign/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /Mailers/
Disallow /clp/
Disallow /banner/*
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Allow /themes/custom/geptheme/favicon.ico
Disallow /%3Cnolink%3E
Disallow /mind/blog/tag/*
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /content/
Disallow /node/*
Disallow /GEPBI/help/WebUser/WebHelp/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=user%2Fpassword%2F
Disallow /?q=user%2Fregister%2F
Disallow /?q=user%2Flogin%2F
Disallow /?q=user%2Flogout%2F
Disallow /media_colorbox/
Disallow /mind/blog/category/*
Disallow /access-denied
Disallow /login-box
Disallow /ext-url
Disallow /%20
Disallow /%2B
Disallow /it-it/newsroom*

blp_bbot/0.1

Rule Path
Disallow /

blp_bbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gep.com/sitemap.xml

Comments

  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Block BLP BOT