socialresearchmethods.net
robots.txt

Robots Exclusion Standard data for socialresearchmethods.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	socialresearchmethods.net
Base Domain	socialresearchmethods.net
Scan Status	Ok
Last Scan	2024-09-22T00:56:33+00:00
Next Scan	2024-10-22T00:56:33+00:00

Last Scan

Scanned	2024-09-22T00:56:33+00:00
URL	https://socialresearchmethods.net/robots.txt
Redirect	https://conjointly.com/robots.txt
Redirect Domain	conjointly.com
Redirect Base	conjointly.com
Domain IPs	3.165.102.107, 3.165.102.52, 3.165.102.71, 3.165.102.73
Redirect IPs	108.156.133.15, 108.156.133.38, 108.156.133.69, 108.156.133.85
Response IP	108.156.133.38
Found	Yes
Hash	ebbea2400e322f567bcbd1de271a4f432174b9bb3f499428a0b0ce9991d5cd67
SimHash	6050181827b1

Groups

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Allow	/
Disallow	/booking-confirmed.html
Disallow	/booking-confirmed/
Disallow	/googled979b0cc4477a394.html
Disallow	/googled979b0cc4477a394/
Disallow	/analysis/

Rule

Path

Allow

/

Disallow

/booking-confirmed.html

Disallow

/booking-confirmed/

Disallow

/googled979b0cc4477a394.html

Disallow

/googled979b0cc4477a394/

Disallow

/analysis/

Back to top

Other Records

Field	Value
sitemap	https://conjointly.com/sitemap.xml
sitemap	https://conjointly.com/image-sitemap.xml

Field

Value

sitemap

https://conjointly.com/sitemap.xml

sitemap

https://conjointly.com/image-sitemap.xml

Back to top

Comments

Disallow data scraping and usage of website content for AI model training or prompting.
Explicit opt-out from certain crawlers is not an invitation for others to train AI models on our content.
Data scraping and model training must be opt-in, not opt-out.
Demand consent, credit, and compensation.
CreateDontScrape

Back to top

Warnings

`host` is not a known field.

Back to top

socialresearchmethods.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

gptbot

ccbot

google-extended

claudebot

*

Other Records

Comments

Warnings

socialresearchmethods.net
robots.txt