Clear sensitive memory without getting optimized out (revival of #636) #1579

theStack · 2024-08-06T21:10:30Z

This PR picks up #636 (which in turn picked up #448, so this is take number three) and is essentially a rebase on master.

Some changes to the original PR:

the clearing function now has the secp256k1_ prefix again, since the related helper _memczero got it as well (see PR Don't use reserved identifiers memczero and benchmark_verify_t #835 / commit e89278f)
the original commit b17a7df ("Make _set_fe_int( . , 0 ) set magnitude to 0") is not needed anymore, since it was already applied in PR VERIFY_CHECK precondition for secp256k1_fe_set_int. #943 (commit d49011f)
clearing of stack memory with secp256k1_memclear is now also done on modules that have been newly introduced since then, i.e. schnorr and ellswift (of course, there is still no guarantee that all places where clearing is necessary are covered)

So far I haven't looked at any disassembly and possible performance implications yet (there were some concerns expressed in #636 (comment)), happy to go deeper there if this gets Concept ACKed.

The proposed method of using a memory barrier to prevent optimizating away the memset is still used in BoringSSL (where it was originally picked up from) and in the Linux Kernel, see e.g. https://github.com/google/boringssl/blob/5af122c3dfc163b5d1859f1f450756e8e320a142/crypto/mem.c#L335 and https://github.com/torvalds/linux/blob/d4560686726f7a357922f300fc81f5964be8df04/include/linux/string.h#L348 / https://github.com/torvalds/linux/blob/d4560686726f7a357922f300fc81f5964be8df04/include/linux/compiler.h#L102

Fixes #185.

real-or-random

Concept ACK (obviously)

Thanks for reviving this, I never had the time/motivation to come back to this PR, but it's important.

We should call SECP256K1_CHECKMEM_UNDEFINE (

secp256k1/src/checkmem.h

Line 27 in b307614

* - SECP256K1_CHECKMEM_UNDEFINE(p, len):

) in secp256k1_memclear after clearing the memory. Reading cleared memory would clearly be a bug.

theStack · 2024-08-20T10:23:52Z

We should call SECP256K1_CHECKMEM_UNDEFINE (

secp256k1/src/checkmem.h

Line 27 in b307614

* - SECP256K1_CHECKMEM_UNDEFINE(p, len):

) in secp256k1_memclear after clearing the memory. Reading cleared memory would clearly be a bug.

Thanks, added that, and rebased on master.

real-or-random · 2024-10-08T12:10:54Z

@theStack Can you rebase this on top of musig which has introduced a few more code locations that need clearing?

Personally, I'd love to have this in the next release.

This code is not supposed to handle secret data.

theStack · 2024-10-08T22:51:17Z

@theStack Can you rebase this on top of musig which has introduced a few more code locations that need clearing?

Sure, done. With only five lines changed in the musig module, this needed less effort than expected, hope I didn't miss any instances (many of them are handled indirectly via the ..._{fe,scalar}_clear functions though).

Relevant excerpt of the range-diff (uncolored here)

$ git range-diff ac0e41...0b01d2
5:  6fcbae9 ! 15:  02ee811 Use secp256k1_memclear() to clear stack memory instead of memset()
    @@ src/modules/ellswift/main_impl.h: int secp256k1_ellswift_xdh(const secp256k1_con
          secp256k1_scalar_clear(&s);
      
     
    + ## src/modules/musig/session_impl.h ##
    +@@ src/modules/musig/session_impl.h: static void secp256k1_nonce_function_musig(secp256k1_scalar *k, const unsigned c
    +         secp256k1_scalar_set_b32(&k[i], buf, NULL);
    + 
    +         /* Attempt to erase secret data */
    +-        memset(buf, 0, sizeof(buf));
    +-        memset(&sha_tmp, 0, sizeof(sha_tmp));
    ++        secp256k1_memclear(buf, sizeof(buf));
    ++        secp256k1_memclear(&sha_tmp, sizeof(sha_tmp));
    +     }
    +-    memset(rand, 0, sizeof(rand));
    +-    memset(&sha, 0, sizeof(sha));
    ++    secp256k1_memclear(rand, sizeof(rand));
    ++    secp256k1_memclear(&sha, sizeof(sha));
    + }
    + 
    + int secp256k1_musig_nonce_gen_internal(const secp256k1_context* ctx, secp256k1_musig_secnonce *secnonce, secp256k1_musig_pubnonce *pubnonce, const unsigned char *input_nonce, const unsigned char *seckey, const secp256k1_pubkey *pubkey, const unsigned char *msg32, const secp256k1_musig_keyagg_cache *keyagg_cache, const unsigned char *extra_input32) {
    +@@ src/modules/musig/session_impl.h: int secp256k1_musig_nonce_gen_counter(const secp256k1_context* ctx, secp256k1_mu
    +     if (!secp256k1_musig_nonce_gen_internal(ctx, secnonce, pubnonce, buf, seckey, &pubkey, msg32, keyagg_cache, extra_input32)) {
    +         return 0;
    +     }
    +-    memset(seckey, 0, sizeof(seckey));
    ++    secp256k1_memclear(seckey, sizeof(seckey));
    +     return 1;
    + }
    + 
    +

real-or-random · 2024-10-08T22:59:45Z

This one comes to my mind, too:

secp256k1/src/util.h

Lines 253 to 254 in a88aa93

    
           /* acc may contain secret values. Try to explicitly clear it. */ 
        
           acc = 0;

theStack · 2024-10-08T23:16:40Z

This one comes to my mind, too:

secp256k1/src/util.h

Lines 253 to 254 in a88aa93

/* acc may contain secret values. Try to explicitly clear it. */

acc = 0;

Ah good catch, missed that (only grepped for memset calls). Tackled now as well.

real-or-random

If you want motivated, you could look at git grep secp256k1_ge_set_gej.*(. @sipa's recent comment in the MuSig PR really caught my attention. When I worked on the previous PR, I really didn't consider the possibility that a gej could leak secret data.

I wonder if it makes sense to have the _finalize functions in the hash module clear the state automatically. And then have "unsafe" funtions like _finalize_keep if the callers knows that data is public or if the caller wants to reuse the midstate. It sounds like a neat idea, but I'm not really convinced because it special-cases the hash module somewhat: we'll need to manually clear everything else including scalars, etc... So we have to be careful with this anyway when writing and reviewing code, and perhaps having yet another safety mechanism that works only for the hash module. (This could perhaps be a nice follow-up if we add the same mechanism to more modules, e.g., ge_set_gej could clear the gejs, unless you use a _keep variant.)

src/util.h

real-or-random · 2024-10-10T12:23:14Z

src/secp256k1.c

   for (i = 0; i <= counter; i++) {
       secp256k1_rfc6979_hmac_sha256_generate(&rng, nonce32, 32);
   }
   secp256k1_rfc6979_hmac_sha256_finalize(&rng);
+
+   secp256k1_memclear(keydata, sizeof(keydata));
+   secp256k1_rfc6979_hmac_sha256_clear(&rng);


We should add similar calls in other code locations where we derive secrets via hash functions, e.g., in the schnorrsig and musig modules (and ellswift? now sure). The previous PR predates even the schnorrsig module, see the commit message.

We should add similar calls in other code locations where we derive secrets via hash functions, e.g., in the schnorrsig and musig modules (and ellswift? now sure).

As far as I could see by greping for _finalize, all places where we derive secrets via hashing had corresponding hash object clearing calls already (probably I did that already in the first version of this PR, though I honestly don't remember). What I found though was that it was unnecessarily called in the schnorrsig challenge function, so I removed it again, as there are no secrets involved. For the musig module, the _memclear calls were replaced by _sha256_clear calls.

One place where I added a hash clearing call was in the public API function secp256k1_tagged_sha256. We don't use that internally, but it seems to make sense to clear the hash object, considering that we don't know if the user passes in secret data or not.

The previous PR predates even the schnorrsig module, see the commit message.

Right, removed that part of the commit message.

theStack · 2024-10-10T17:38:47Z

If you want motivated, you could look at git grep secp256k1_ge_set_gej.*(. @sipa's recent comment in the MuSig PR really caught my attention. When I worked on the previous PR, I really didn't consider the possibility that a gej could leak secret data.

I found the following functions containing _gej instances resulting from point multiplication (secp256k1_ecmult_{gen,const}) with secret scalars:

ECDH module: secp256k1_ecdh (res, resulting from the ECDH point multiplication; I think this should be cleared not only to avoid leaking the scalar, but even more so as it's a representation of the resulting shared secret)
MuSig2 module: secp256k1_musig_nonce_gen_internal (public nonce nonce_ptj in the for loop)
Schnorr module: secp256k1_schnorrsig_sign_internal (nonce commitment rj)
main library: secp256k1_ec_pubkey_create_helper (public point pj)

Taking care of those could be probably go into an own PR, as its trivial to fix and review and hence has a significantly higher chance to land in the next release than this one? (and having a version where the compiler still might optimize it out seems still much better than not doing it at all)

Interestingly, the ECDSA signing function does clear out the nonce commitment for both the jacobi and affine points (though the latter wouldn't be needed according to #1479 (comment)).

I wonder if it makes sense to have the _finalize functions in the hash module clear the state automatically. And then have "unsafe" funtions like _finalize_keep if the callers knows that data is public or if the caller wants to reuse the midstate. It sounds like a neat idea, but I'm not really convinced because it special-cases the hash module somewhat: we'll need to manually clear everything else including scalars, etc... So we have to be careful with this anyway when writing and reviewing code, and perhaps having yet another safety mechanism that works only for the hash module. (This could perhaps be a nice follow-up if we add the same mechanism to more modules, e.g., ge_set_gej could clear the gejs, unless you use a _keep variant.)

Sounds like a good idea to me for a follow-up.

real-or-random · 2024-10-10T20:55:15Z

Taking care of those could be probably go into an own PR, as its trivial to fix and review and hence has a significantly higher chance to land in the next release than this one? (and having a version where the compiler still might optimize it out seems still much better than not doing it at all)

My guess is that what we do currently is useless on any modern compiler. I admit that I haven't looked at the compiler output, but I'd rather spend the time on resolving the problem.

I don't think that fixing these additional cases here will make it much more difficult to review the PR. And to be honest, while this is a great for defense in depth, we haven't deeply cared about this so far. It's not the end of the world if we need to wait a few more months. So I think it's good to add these cases to this PR here.

We rely on memset() and an __asm__ memory barrier where it's available or on SecureZeroMemory() on Windows. The fallback implementation uses a volatile function pointer to memset which the compiler is not clever enough to optimize.

There are two uses of the secp256k1_fe_clear() function that are now separated into these two functions in order to reflect the intent: 1) initializing the memory prior to being used -> converted to fe_set_int( . , 0 ) 2) zeroing the memory after being used such that no sensitive data remains. -> remains as fe_clear() In the latter case, 'magnitude' and 'normalized' need to be overwritten when VERIFY is enabled. Co-Authored-By: isle2983 <isle2983@yahoo.com>

Co-Authored-By: isle2983 <isle2983@yahoo.com> Co-Authored-By: Pieter Wuille <pieter.wuille@gmail.com>

All of the invocations of secp256k1_memclear() operate on stack memory and happen after the function is done with the memory object. This commit replaces existing memset() invocations and also adds secp256k1_memclear() to code locations where clearing was missing; there is no guarantee that this commit covers all code locations where clearing is necessary. Co-Authored-By: isle2983 <isle2983@yahoo.com>

This gives the caller more control about whether the state should be cleaned (= should be considered secret). Moreover, it gives the caller the possibility to clean a hash struct without finalizing it.

Quoting sipa (see bitcoin-core#1479 (comment)): "When performing an EC multiplication A = aG for secret a, the resulting _affine_ coordinates of A are presumed to not leak information about a (ECDLP), but the same is not necessarily true for the Jacobian coordinates that come out of our multiplication algorithm." For the ECDH point multiplication result, the result in Jacobi coordinates should be cleared not only to avoid leaking the scalar, but even more so as it's a representation of the resulting shared secret.

theStack · 2024-10-11T16:29:59Z

My guess is that what we do currently is useless on any modern compiler. I admit that I haven't looked at the compiler output, but I'd rather spend the time on resolving the problem.

I don't think that fixing these additional cases here will make it much more difficult to review the PR. And to be honest, while this is a great for defense in depth, we haven't deeply cared about this so far. It's not the end of the world if we need to wait a few more months. So I think it's good to add these cases to this PR here.

Thanks for the feedback! Makes sense, added an extra commit at the end for the gej clearing after point multiplications, it's only four lines of code anyway. Also went over the necessary hash clearing places and made a few small changes (see #1579 (comment)).

theStack force-pushed the revival_of_pr636_cleanse branch from deebc69 to abb3be2 Compare August 6, 2024 21:19

real-or-random added the side-channel label Aug 17, 2024

real-or-random reviewed Aug 17, 2024

View reviewed changes

theStack force-pushed the revival_of_pr636_cleanse branch from abb3be2 to ac0e41b Compare August 20, 2024 10:13

Don't clear secrets in pippenger implementation

ec60c24

This code is not supposed to handle secret data.

theStack force-pushed the revival_of_pr636_cleanse branch from ac0e41b to 0b01d2a Compare October 8, 2024 22:51

theStack force-pushed the revival_of_pr636_cleanse branch from 0b01d2a to 05a46b5 Compare October 8, 2024 23:14

real-or-random reviewed Oct 10, 2024

View reviewed changes

real-or-random added the needs-changelog label Oct 10, 2024

real-or-random and others added 8 commits October 11, 2024 16:26

Add secp256k1_memclear() for clearing secret data

767ac09

We rely on memset() and an __asm__ memory barrier where it's available or on SecureZeroMemory() on Windows. The fallback implementation uses a volatile function pointer to memset which the compiler is not clever enough to optimize.

Separate between clearing memory and setting to zero in tests

59e2202

Co-Authored-By: isle2983 <isle2983@yahoo.com> Co-Authored-By: Pieter Wuille <pieter.wuille@gmail.com>

Implement various _clear() functions with secp256k1_memclear()

b8f4eb0

Don't rely on memset to set signed integers to 0

4f0e4d1

Introduce separate _clear functions for hash module

3d88249

This gives the caller more control about whether the state should be cleaned (= should be considered secret). Moreover, it gives the caller the possibility to clean a hash struct without finalizing it.

theStack force-pushed the revival_of_pr636_cleanse branch from 05a46b5 to c921078 Compare October 11, 2024 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clear sensitive memory without getting optimized out (revival of #636) #1579

Clear sensitive memory without getting optimized out (revival of #636) #1579

theStack commented Aug 6, 2024

real-or-random left a comment •

edited

Loading

theStack commented Aug 20, 2024

real-or-random commented Oct 8, 2024

theStack commented Oct 8, 2024

real-or-random commented Oct 8, 2024

theStack commented Oct 8, 2024

real-or-random left a comment

real-or-random Oct 10, 2024

theStack Oct 11, 2024

theStack commented Oct 10, 2024

real-or-random commented Oct 10, 2024 •

edited

Loading

theStack commented Oct 11, 2024

Clear sensitive memory without getting optimized out (revival of #636) #1579

Are you sure you want to change the base?

Clear sensitive memory without getting optimized out (revival of #636) #1579

Conversation

theStack commented Aug 6, 2024

real-or-random left a comment • edited Loading

Choose a reason for hiding this comment

theStack commented Aug 20, 2024

real-or-random commented Oct 8, 2024

theStack commented Oct 8, 2024

real-or-random commented Oct 8, 2024

theStack commented Oct 8, 2024

real-or-random left a comment

Choose a reason for hiding this comment

real-or-random Oct 10, 2024

Choose a reason for hiding this comment

theStack Oct 11, 2024

Choose a reason for hiding this comment

theStack commented Oct 10, 2024

real-or-random commented Oct 10, 2024 • edited Loading

theStack commented Oct 11, 2024

real-or-random left a comment •

edited

Loading

real-or-random commented Oct 10, 2024 •

edited

Loading