thisisuscrying.com
robots.txt

Robots Exclusion Standard data for thisisuscrying.com

Resource Scan

Scan Details

Site Domain thisisuscrying.com
Base Domain thisisuscrying.com
Scan Status Ok
Last Scan2025-10-20T05:46:26+00:00
Next Scan 2025-10-27T05:46:26+00:00

Last Scan

Scanned2025-10-20T05:46:26+00:00
URL https://thisisuscrying.com/robots.txt
Domain IPs 13.33.45.106, 13.33.45.118, 13.33.45.68, 13.33.45.75, 2600:9000:229f:2e00:b:9c0b:f980:93a1, 2600:9000:229f:5400:b:9c0b:f980:93a1, 2600:9000:229f:5a00:b:9c0b:f980:93a1, 2600:9000:229f:600:b:9c0b:f980:93a1, 2600:9000:229f:6200:b:9c0b:f980:93a1, 2600:9000:229f:8600:b:9c0b:f980:93a1, 2600:9000:229f:ae00:b:9c0b:f980:93a1, 2600:9000:229f:cc00:b:9c0b:f980:93a1
Response IP 13.33.45.75
Found Yes
Hash ee6a89e900f223fcecdd3705c8784e9400a57f585f21b84eaa672c75e271e0f3
SimHash ee254b0a09e2

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow */?*utm_source=*
Disallow */?*utm_campaign=*
Disallow */?*utm_medium=*
Disallow /*?source=*
Disallow */?*utm_newsbreak=*

*

Rule Path
Allow /ads.txt

*

Rule Path
Disallow *?embed*

*

Rule Path
Disallow */*a_aid%3D*
Disallow /*?partner=*

*

Rule Path
Disallow */*?mm-experiments=*

*

Rule Path
Disallow /*?*s=*

*

Rule Path
Disallow *?app=*

*

Rule Path
Disallow *?fbclid=*

*

Rule Path
Disallow /*?setLocale=*
Disallow *?georedirect=*

*

Rule Path
Disallow /*?term=*
Disallow *?ref=*
Disallow /*?view_source=*
Disallow /*?view_medium=*
Disallow /*?initialLeagueId=*

*

Rule Path
Disallow *_ga_*
Disallow */?_gl=*

*

Rule Path
Disallow */api/*

*

Rule Path
Disallow */videos/undefinedc_fill%2Cw_360%2Car_16%3A9%2Cf_auto%2Cq_auto%2Cg_auto/undefined
Disallow */teams/mainNavigationChevron_icon.svg?*
Disallow */leagues/mainNavigationChevron_icon.svg?*
Disallow */undefinedc_fill%2Cw_360%2Car_16%3A9%2Cf_auto%2Cq_auto%2Cg_auto/undefined

*

Rule Path
Disallow */files/*
Disallow */wp-admin/*
Disallow */?*utm_newsbreak=*
Disallow */wp-content/*
Disallow */wp-includes/*
Disallow */app/*
Disallow */embed_code*
Disallow */%7B%7Burl/*
Disallow */v2/*

twitterbot

Rule Path
Allow *

facebookbot

Rule Path
Allow *

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://thisisuscrying.com/news-sitemap.xml
sitemap https://thisisuscrying.com/feed-sitemap.xml
sitemap https://thisisuscrying.com/sitemap.xml

Comments

  • Allow all search engines to crawl
  • Disallow Parameters
  • GA traffic source parameters
  • Allow crawling ads
  • Embedded widget parameters
  • Influencers/Affiliate Links parameters
  • Experiments testing team
  • Search box param
  • Apps param
  • FB campaigns
  • GEO targeting:
  • Unknown Parameters
  • GA parameters - Generated from Google caching
  • Generated from Voltax API url
  • Voltax HTML Unknown crawled urls
  • Generated from Fansided WP
  • Social Media Robots
  • Sitemap XML

Warnings

  • 2 invalid lines.