sift.com
robots.txt

Robots Exclusion Standard data for sift.com

Resource Scan

Scan Details

Site Domain sift.com
Base Domain sift.com
Scan Status Ok
Last Scan2025-10-24T02:33:44+00:00
Next Scan 2025-11-23T02:33:44+00:00

Last Scan

Scanned2025-10-24T02:33:44+00:00
URL https://sift.com/robots.txt
Domain IPs 23.185.0.4, 2620:12a:8000::4, 2620:12a:8001::4
Response IP 23.185.0.4
Found Yes
Hash 265ddfcbb0de08cc959238b11a38b6adf9807e49be3412b0bc1e6ffa473c2b87
SimHash 40d90a00e51e

Groups

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/
Disallow /wp-json/
Disallow /?rest_route=

slackbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

google-extended

Rule Path
Allow /

gptbot

Rule Path
Allow /

ccbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

applebot

Rule Path
Allow /

anthropicbot

Rule Path
Allow /

xai-bot

Rule Path
Allow /

coherebot

Rule Path
Allow /

llamabot

Rule Path
Allow /

mistralbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://sift.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK
  • Sitemap declaration
  • Allow All Major AI and Search Bots