sg.match.com
robots.txt

Robots Exclusion Standard data for sg.match.com

Resource Scan

Scan Details

Site Domain sg.match.com
Base Domain match.com
Scan Status Ok
Last Scan2025-10-08T08:08:18+00:00
Next Scan 2025-10-15T08:08:18+00:00

Last Scan

Scanned2025-10-08T08:08:18+00:00
URL https://sg.match.com/robots.txt
Domain IPs 208.83.240.49
Response IP 208.83.242.49
Found Yes
Hash 6e7b6c4353aa6272adb3726e91a9ad74ef12048eb8e344bb7501d08c6be5016d
SimHash 567f7021c191

Groups

*

Rule Path
Disallow /redalert
Disallow /bin
Disallow /html
Disallow /keynote
Disallow /api/
Disallow /subscribe
Disallow /help/contactus.aspx
Disallow /201*
Disallow /ajax
Disallow /authent
Disallow /dailymatches
Disallow /fullsignup
Disallow /home
Disallow /index.php
Disallow /mailbox
Disallow /messenger
Disallow /myaccount
Disallow /scheduler.php
Disallow /search
Disallow /signup
Disallow *?Profile=*
Disallow *?display_ccform=*
Disallow *?dtd_id=*
Disallow *?id_itw=*
Disallow *?pg=*
Disallow *?query=*
Disallow *?ref=*
Disallow *?v=*
Disallow /apida/
Disallow /apimm/
Disallow /d/
Disallow /m/
Disallow */misc/
Disallow */wp-content/

mediapartners-google

Rule Path
Allow /p/nw/
Disallow /

adidxbot
baiduspider
ccbot
facebot
gptbot
httrack
ia_archiver
nerdybot
omniexplorer_bot
scoutjet
searchmetricsbot
turnitinbot
wget
yandex

Rule Path
Disallow /