wheresthematch.com
robots.txt

Robots Exclusion Standard data for wheresthematch.com

Resource Scan

Scan Details

Site Domain wheresthematch.com
Base Domain wheresthematch.com
Scan Status Ok
Last Scan2026-01-29T04:06:13+00:00
Next Scan 2026-02-05T04:06:13+00:00

Last Scan

Scanned2026-01-29T04:06:13+00:00
URL https://wheresthematch.com/robots.txt
Redirect https://www.wheresthematch.com/robots.txt
Redirect Domain www.wheresthematch.com
Redirect Base wheresthematch.com
Domain IPs 77.68.93.186
Redirect IPs 77.68.93.186
Response IP 77.68.93.186
Found Yes
Hash e4da183aaa5d3eb6371248e9fa0f2c05964a6b3e743f0425e9009b5ef36d813c
SimHash 02549d0644b5

Groups

*

Rule Path
Disallow /live-sport-on-tv/?paging=true

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.wheresthematch.com/sitemap.xml

Comments

  • Robots file for www.wheresthematch.com site
  • Apply rules to all user agents
  • Folder exclusions
  • Folder exclusions DE
  • Sitemap file
  • AI and LLM access
  • AI: /llms.txt