tesiverified.it
robots.txt

Robots Exclusion Standard data for tesiverified.it

Resource Scan

Scan Details

Site Domain tesiverified.it
Base Domain tesiverified.it
Scan Status Ok
Last Scan2024-10-27T07:59:48+00:00
Next Scan 2024-11-26T07:59:48+00:00

Last Scan

Scanned2024-10-27T07:59:48+00:00
URL https://tesiverified.it/robots.txt
Domain IPs 104.21.84.74, 172.67.188.164, 2606:4700:3033::ac43:bca4, 2606:4700:3037::6815:544a
Response IP 172.67.188.164
Found Yes
Hash e6cd430996f5446edafe5852b07a4d0f3345f894b9cdd8ab365b0098ed2cca33
SimHash 41148463c794

Groups

ia_archiver

Rule Path
Disallow /curr/viewcurr.asp
Disallow /google-scholar-data/

*

Rule Path
Disallow /__PDF/
Disallow /v2/appunto-pdf.jsp*
Disallow /tesiteca_docs/
Disallow /inviotesi/
Disallow /tesiteca/
Disallow /logons/logon.asp?*
Disallow /consult/cart.jsp?*
Disallow /v3/pdf-js-viewer/web/viewerExtract?file=*
Disallow /v3/pdf-js-viewer/web/viewerPreview.html?file=*
Disallow /v3/pdf-js-viewer/web/viewer.html?file=%2FworksheetPdfPreview%2F*

bingbot

Rule Path
Disallow /tesiteca/

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

boldbrains

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tesionline.it/sitemapindex.xml

Comments

  • Crawl-delay: 1
  • User-Agent: YahooSeeker/M1A1-R2D2
  • User-Agent: MSNBOT_Mobile
  • Allow: /mobile/
  • Disallow: /mobile/
  • User-agent: Yandex
  • Disallow: /