treffisivut.fi
robots.txt

Robots Exclusion Standard data for treffisivut.fi

Resource Scan

Scan Details

Site Domain treffisivut.fi
Base Domain treffisivut.fi
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-14T10:58:27+00:00
Next Scan 2024-11-28T10:58:27+00:00

Last Successful Scan

Scanned2024-10-07T10:57:28+00:00
URL https://www.treffisivut.fi/robots.txt
Domain IPs 104.21.65.133, 172.67.163.123, 2606:4700:3033::6815:4185, 2606:4700:3033::ac43:a37b
Response IP 172.67.163.123
Found Yes
Hash 25881f50515a6229ec12f7257849277b5452dbc0f92bd4cb1af42364de2519f9
SimHash a034dd7cefaf

Groups

*

Rule Path
Disallow /linkki/
Disallow /?s=
Disallow /search/
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-json/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /readme.html
Disallow */embed
Disallow */rss
Disallow */feed
Disallow /rss/
Disallow /feed/
Disallow /comments/feed/
Disallow */comments/
Disallow *?replytocom
Disallow /trackback/
Disallow /wp-trackback.php
Allow /wp-admin/admin-ajax.php
Allow /*.js*
Allow /*.css*
Allow /wp-includes/js/
Allow /wp-includes/css/
Allow /wp-includes/*.js*
Allow /wp-includes/*.css*
Allow /wp-content/*.js*
Allow /wp-content/*.css*
Allow /wp-content/uploads/
Allow /wp-content/plugins/*.js*
Allow /wp-content/plugins/*.css*
Allow /wp-content/themes/*.js*
Allow /wp-content/themes/*.css*
Allow /wp-content/themes/dating/js/
Allow /wp-content/themes/dating/css/
Allow /wp-content/themes/dating/fonts/
Allow /wp-content/themes/dating/images/

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ahrefsbot/3.1

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ahrefsbot/2.0

Rule Path
Disallow /

ahrefs

Rule Path
Disallow /

ahrefs.com<http://ahrefs.com>

Rule Path
Disallow /

http://ahrefs.com/robot/

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

xenuãƒâ¢ã¢â€šâ¬ã¢â€žâ¢s

Rule Path
Disallow /

xenuãƒâ¢ã¢â€šâ¬ã¢â€žâ¢s link sleuth 1.1c

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

tineye

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

archive.org bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

Comments

  • Optimized robots.txt
  • for www.treffisivut.fi
  • v1.0 2021-10-18
  • It is expressly forbidden to use spiders or other
  • automated methods to access www.pornotuubi.com. Only if www.pornotuubi.com
  • has given special permit such access is allowed.
  • Common list for most search engines
  • Block some of the most popular SEO agents
  • Block TinEye from crawling site
  • theTradeDesk
  • Block ia_archiver from crawling site
  • Block archive.org_bot from crawling site
  • Block Archive.org Bot from crawling site
  • Block ia_archiver-web.archive.org_bot from crawling site