sg.match.com
robots.txt

Robots Exclusion Standard data for sg.match.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	sg.match.com
Base Domain	match.com
Scan Status	Ok
Last Scan	2025-10-08T08:08:18+00:00
Next Scan	2025-10-15T08:08:18+00:00

Last Scan

Scanned	2025-10-08T08:08:18+00:00
URL	https://sg.match.com/robots.txt
Domain IPs	208.83.240.49
Response IP	208.83.242.49
Found	Yes
Hash	6e7b6c4353aa6272adb3726e91a9ad74ef12048eb8e344bb7501d08c6be5016d
SimHash	567f7021c191

Groups

*

Rule	Path
Disallow	/redalert
Disallow	/bin
Disallow	/html
Disallow	/keynote
Disallow	/api/
Disallow	/subscribe
Disallow	/help/contactus.aspx
Disallow	/201*
Disallow	/ajax
Disallow	/authent
Disallow	/dailymatches
Disallow	/fullsignup
Disallow	/home
Disallow	/index.php
Disallow	/mailbox
Disallow	/messenger
Disallow	/myaccount
Disallow	/scheduler.php
Disallow	/search
Disallow	/signup
Disallow	?Profile=
Disallow	?display_ccform=
Disallow	?dtd_id=
Disallow	?id_itw=
Disallow	?pg=
Disallow	?query=
Disallow	?ref=
Disallow	?v=
Disallow	/apida/
Disallow	/apimm/
Disallow	/d/
Disallow	/m/
Disallow	*/misc/
Disallow	*/wp-content/

Rule

Path

Disallow

/redalert

Disallow

/bin

Disallow

/html

Disallow

/keynote

Disallow

/api/

Disallow

/subscribe

Disallow

/help/contactus.aspx

Disallow

/201*

Disallow

/ajax

Disallow

/authent

Disallow

/dailymatches

Disallow

/fullsignup

Disallow

/home

Disallow

/index.php

Disallow

/mailbox

Disallow

/messenger

Disallow

/myaccount

Disallow

/scheduler.php

Disallow

/search

Disallow

/signup

Disallow

*?Profile=*

Disallow

*?display_ccform=*

Disallow

*?dtd_id=*

Disallow

*?id_itw=*

Disallow

*?pg=*

Disallow

*?query=*

Disallow

*?ref=*

Disallow

*?v=*

Disallow

/apida/

Disallow

/apimm/

Disallow

/d/

Disallow

/m/

Disallow

*/misc/

Disallow

*/wp-content/

mediapartners-google

Rule	Path
Allow	/p/nw/
Disallow	/

Rule

Path

Allow

/p/nw/

Disallow

/

adidxbot
baiduspider
ccbot
facebot
gptbot
httrack
ia_archiver
nerdybot
omniexplorer_bot
scoutjet
searchmetricsbot
turnitinbot
wget
yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

sg.match.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

mediapartners-google

adidxbotbaiduspiderccbotfacebotgptbothttrackia_archivernerdybotomniexplorer_botscoutjetsearchmetricsbotturnitinbotwgetyandex

sg.match.com
robots.txt

adidxbot
baiduspider
ccbot
facebot
gptbot
httrack
ia_archiver
nerdybot
omniexplorer_bot
scoutjet
searchmetricsbot
turnitinbot
wget
yandex