giventake.net
robots.txt

Robots Exclusion Standard data for giventake.net

Resource Scan

Scan Details

Site Domain giventake.net
Base Domain giventake.net
Scan Status Ok
Last Scan2024-10-29T22:10:13+00:00
Next Scan 2024-11-28T22:10:13+00:00

Last Scan

Scanned2024-10-29T22:10:13+00:00
URL https://giventake.net/robots.txt
Redirect https://www.giventake.net/robots.txt
Redirect Domain www.giventake.net
Redirect Base giventake.net
Domain IPs 104.21.85.131, 172.67.206.46, 2606:4700:3032::ac43:ce2e, 2606:4700:3037::6815:5583
Redirect IPs 104.21.85.131, 172.67.206.46, 2606:4700:3032::ac43:ce2e, 2606:4700:3037::6815:5583
Response IP 104.21.85.131
Found Yes
Hash 3d3349fd44e0de296da05d5f7ee0e95f6dfc613191411853c0c9545ba0ecb3af
SimHash fe601750ade8

Groups

barkrowler
blexbot
builtwith
dataforseobot
dataprovider
dotbot
gptbot
ioncrawl
megaindex.ru
mj12bot
mozilla/5.0 (x11; datanyze; linux x86_64) applewebkit/537.36 (khtml, like gecko) chrome/65.0.3325.181 safari/537.36
obot
semrushbot
surdotlybot
turnitinbot
turnitinbot
wellknownbot
zoombot
zoominfobot

Rule Path
Disallow /

*

Rule Path
Disallow /personalArea/
Disallow /*actionMessage
Disallow /*src%3Dfeed-atom
Disallow /*/agora/ad/
Disallow /*?*page&
Disallow /*?*page$
Disallow /*/communities/*/login
Disallow /*/communities/*/signup

Comments

  • this is robots.txt
  • block undesired crawlers
  • generic bots
  • Don't crawl personal area
  • Don't crawl action message
  • block fictive addresses
  • block settings attributes
  • Disallow: *setLang=*
  • Disallow: /*listings/*?*resource
  • Disallow: /*communities/*?*resource
  • block agora redirected pages
  • block non content changing parameters
  • Disallow: /*?*expanded
  • Disallow: /*?*canContinue
  • Disallow: /*?*imageSearch
  • block communities login page