gep.com
robots.txt

Robots Exclusion Standard data for gep.com

Resource Scan

Scan Details

Site Domain gep.com
Base Domain gep.com
Scan Status Ok
Last Scan2024-11-02T07:10:32+00:00
Next Scan 2024-12-02T07:10:32+00:00

Last Scan

Scanned2024-11-02T07:10:32+00:00
URL https://gep.com/robots.txt
Redirect https://www.gep.com:443/robots.txt
Redirect Domain www.gep.com
Redirect Base gep.com
Domain IPs 13.248.188.103, 76.223.53.232
Redirect IPs 13.33.183.116, 13.33.183.24, 13.33.183.63, 13.33.183.78, 2600:9000:2816:5200:1c:f167:7300:93a1, 2600:9000:2816:5e00:1c:f167:7300:93a1, 2600:9000:2816:800:1c:f167:7300:93a1, 2600:9000:2816:8400:1c:f167:7300:93a1, 2600:9000:2816:ba00:1c:f167:7300:93a1, 2600:9000:2816:d200:1c:f167:7300:93a1, 2600:9000:2816:da00:1c:f167:7300:93a1, 2600:9000:2816:f000:1c:f167:7300:93a1
Response IP 13.35.210.119
Found Yes
Hash 8046e4d0c3cd1a6ccf980968e85e3e358a432f196c16b010dcaf809134daa575
SimHash 081660a0cad0

Groups

*

Rule Path
Disallow /taxonomy/
Disallow /brochure-download/*
Disallow /includes/
Disallow /sem/
Disallow /campaign/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /Mailers/
Disallow /clp/
Disallow /banner/*
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Allow /themes/custom/geptheme/favicon.ico
Disallow /%3Cnolink%3E
Disallow /mind/blog/tag/*
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /content/
Disallow /node/*
Disallow /GEPBI/help/WebUser/WebHelp/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=user%2Fpassword%2F
Disallow /?q=user%2Fregister%2F
Disallow /?q=user%2Flogin%2F
Disallow /?q=user%2Flogout%2F
Disallow /media_colorbox/
Disallow /mind/blog/category/*
Disallow /access-denied
Disallow /login-box
Disallow /ext-url
Disallow /%20
Disallow /%2B
Disallow /it-it/newsroom*

blp_bbot/0.1

Rule Path
Disallow /

blp_bbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gep.com/sitemap.xml

Comments

  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Block BLP BOT