trsondakika.com
robots.txt

Robots Exclusion Standard data for trsondakika.com

Resource Scan

Scan Details

Site Domain trsondakika.com
Base Domain trsondakika.com
Scan Status Ok
Last Scan2024-11-12T16:10:52+00:00
Next Scan 2024-11-19T16:10:52+00:00

Last Scan

Scanned2024-11-12T16:10:52+00:00
URL https://trsondakika.com/robots.txt
Domain IPs 104.21.90.86, 172.67.198.84, 2606:4700:3030::6815:5a56, 2606:4700:3030::ac43:c654
Response IP 104.21.90.86
Found Yes
Hash 34a5ddba87758c18048c5f01a4182862df9775bf1e2fe6e5538739b4802b5190
SimHash 8950221646a7

Groups

*

Rule Path
Disallow */xposts/

*

Rule Path
Disallow */reader/
Disallow */api/*

*

Rule Path
Disallow /embed/

*

Rule Path
Disallow /ads/

*

Rule Path
Disallow */editor/*
Disallow */posts/*/edit$

*

Rule Path
Disallow */posts/*/publish$

*

Rule Path
Disallow */singlepage/pipe/*
Disallow */singlepage/uncached_pipe/*
Disallow */singlepage/epipe/*

*

Rule Path
Disallow */reads/*/read$
Disallow */api/*

*

Rule Path
Disallow /matches/
Disallow */embed_code
Disallow /es/partidos/
Disallow /it/partite/
Disallow /de/spiele/
Disallow /zh-CN/*

*

Rule Path
Disallow /ping
Disallow */sessions
Disallow */admin/*
Disallow */management
Disallow */castr
Disallow */unfeature$
Disallow */unfeature/*

*

Rule Path
Disallow /videos/
Allow /channels/videos/

*

Rule Path
Disallow */teams/mainNavigationChevron_icon.svg?6173d8d1c141425871c45b245dfc6740
Disallow */leagues/mainNavigationChevron_icon.svg?6173d8d1c141425871c45b245dfc6740
Disallow /*?view_source=nav_bar&viewmedium=nav_bar_transfer*

twitterbot

Rule Path
Allow *

facebookbot

Rule Path
Allow *

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://trsondakika.com/sitemap.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • allow adsense crawler access to all pages