sift.com
robots.txt

Robots Exclusion Standard data for sift.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	sift.com
Base Domain	sift.com
Scan Status	Ok
Last Scan	2025-10-24T02:33:44+00:00
Next Scan	2025-11-23T02:33:44+00:00

Last Scan

Scanned	2025-10-24T02:33:44+00:00
URL	https://sift.com/robots.txt
Domain IPs	23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4
Response IP	23.185.0.4
Found	Yes
Hash	265ddfcbb0de08cc959238b11a38b6adf9807e49be3412b0bc1e6ffa473c2b87
SimHash	40d90a00e51e

Groups

*

Rule	Path
Disallow	/?s=
Disallow	/page/*/?s=
Disallow	/search/
Disallow	/wp-json/
Disallow	/?rest_route=

Rule

Path

Disallow

/?s=

Disallow

/page/*/?s=

Disallow

/search/

Disallow

/wp-json/

Disallow

/?rest_route=

slackbot

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

claudebot

Rule	Path
Allow	/

Rule

Path

Allow

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

applebot

Rule	Path
Allow	/

Rule

Path

Allow

anthropicbot

Rule	Path
Allow	/

Rule

Path

Allow

xai-bot

Rule	Path
Allow	/

Rule

Path

Allow

coherebot

Rule	Path
Allow	/

Rule

Path

Allow

llamabot

Rule	Path
Allow	/

Rule

Path

Allow

mistralbot

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
sitemap	https://sift.com/sitemap_index.xml

Field

Value

sitemap

https://sift.com/sitemap_index.xml

Comments

START YOAST BLOCK
---------------------------
---------------------------
END YOAST BLOCK
Sitemap declaration
Allow All Major AI and Search Bots

sift.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

slackbot

chatgpt-user

claudebot

perplexitybot

google-extended

gptbot

ccbot

googlebot

bingbot

applebot

anthropicbot

xai-bot

coherebot

llamabot

mistralbot

Other Records

Comments

sift.com
robots.txt