triestecampus.com
robots.txt

Robots Exclusion Standard data for triestecampus.com

Resource Scan

Scan Details

Site Domain triestecampus.com
Base Domain triestecampus.com
Scan Status Ok
Last Scan2025-06-25T03:04:58+00:00
Next Scan 2025-07-25T03:04:58+00:00

Last Scan

Scanned2025-06-25T03:04:58+00:00
URL https://triestecampus.com/robots.txt
Redirect https://www.triestecampus.com/robots.txt
Redirect Domain www.triestecampus.com
Redirect Base triestecampus.com
Domain IPs 51.136.14.31
Redirect IPs 51.136.14.31
Response IP 51.136.14.31
Found Yes
Hash 34e8bf13e0b885073631a8ac8fd97e93ce63cce04c54ca0ff818178345bbbbaf
SimHash 2b3b9c4d0370

Groups

blp_bbot
businessdbbot
ccbot
covarioids
converacrawler
curl/
discobot
download ninja
email exractor
ezooms
fdm 3.x
flaxcrawler
grabber
grapeshot
gslfbot
heritrix
httrack
intelium_bot
istellabot
java/
larbin
lemurwebcrawler
libwww-perl
metamojicrawler
mj12bot
nutch
openacoon
php/
plukkie
proximic
python-urllib
ruby
seokicks
spbot
turnitinbot
yandexbot
wbsearchbot
weblexbot
wget
wire/0.
zyborg

Rule Path
Disallow /

msnbot/bingbot
bingbot
msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

*

Rule Path
Disallow /Admin/
Disallow /admin/
Disallow /hyve-admin
Disallow /hyve-admin/
Disallow /Auth/
Disallow /Auth$
Disallow /auth/
Disallow /Batch/
Disallow /batch/
Disallow /Custom/
Disallow /custom/
Disallow /Tmp/
Disallow /tmp/
Disallow /LazyLogin
Disallow /lazylogin
Disallow /ShowReel
Disallow /showreel
Disallow /Search/SearchCMS
Disallow /search/searchcms
Disallow /*%3Dprint%3Dtrue
Disallow /*%26ListMode%3D*
Disallow /*%26listmode%3D*
Disallow /*?ListMode=*
Disallow /*?listmode=*
Disallow /*%26SortMode%3D*
Disallow /*%26sortmode%3D*
Disallow /*?SortMode=*
Disallow /*?sortmode=*

Other Records

Field Value
sitemap https://www.triestecampus.com/sitemap-index.xml

Warnings

  • 1 invalid line.