tpcc.org
robots.txt

Robots Exclusion Standard data for tpcc.org

Resource Scan

Scan Details

Site Domain tpcc.org
Base Domain tpcc.org
Scan Status Ok
Last Scan2025-10-05T14:21:06+00:00
Next Scan 2025-10-19T14:21:06+00:00

Last Scan

Scanned2025-10-05T14:21:06+00:00
URL https://tpcc.org/robots.txt
Domain IPs 104.26.8.171, 104.26.9.171, 172.67.71.174, 2606:4700:20::681a:8ab, 2606:4700:20::681a:9ab, 2606:4700:20::ac43:47ae
Response IP 104.26.8.171
Found Yes
Hash cefb507e676371e3cfb20ebdb34efdf31bbb8663fce3737ca9d685dca7baeb2b
SimHash 491048984712

Groups

*

Rule Path
Disallow /page/541?returnurl=%252fmyaccount
Disallow /page/564
Disallow /page/564
Disallow /myaccount
Disallow /giving-history
Disallow /contribution-statement
Disallow /page/576
Disallow /page/577
Disallow /page/578
Disallow /page/579
Disallow /page/580
Disallow /Login
Disallow /NewAccount
Disallow /ForgotUserName
Disallow /DISC
Disallow /page/558
Disallow /BaptismRequest
Disallow /GroupRequest
Disallow /Benevolence-Request
Disallow /join-group
Disallow /ServeRequest
Disallow /GiveRequest
Disallow /page/120
Disallow /kiosk
Disallow /checkin*
Disallow /page/446
Disallow /GetFile.ashx?*
Disallow /serve-saturday*
Disallow /community*
Disallow /serve-near-calendar*
Disallow /page/2928
Disallow /page/2635*
Disallow /page/2888*

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://tpcc.org/sitemap.xml