Understanding the hash-to-curve Function

It remains to take a look at what the hash_to_curve function defined in the src/hash module is doing exactly:

use ark_bls12_381::{G1Affine, G1Projective};
use ark_crypto_primitives::crh::{
    pedersen::{Window, CRH},
    CRH as CRHScheme,
};
use rand::SeedableRng;
use rand_chacha::ChaCha20Rng;

// --snip--

#[derive(Clone)]
struct ZkHackPedersenWindow {}

impl Window for ZkHackPedersenWindow {
    const WINDOW_SIZE: usize = 1;
    const NUM_WINDOWS: usize = 256;
}

pub fn hash_to_curve(msg: &[u8]) -> (Vec<u8>, G1Affine) {
    let rng_pedersen = &mut ChaCha20Rng::from_seed([
        1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
        1, 1,
    ]);
    let parameters = CRH::<G1Projective, ZkHackPedersenWindow>::setup(rng_pedersen).unwrap();
    let b2hash = blake2s_simd::blake2s(msg);
    (
        b2hash.as_bytes().to_vec(),
        CRH::<G1Projective, ZkHackPedersenWindow>::evaluate(&parameters, b2hash.as_bytes())
            .unwrap(),
    )
}

This function first initializes the pseudorandom number generator ChaCha20 with a 32-byte seed and feeds this RNG to the setup function. We look up the setup function in the crypto_primitives::crh::pedersen submodule (how do we know where to look? we check the use statement which brings CRH into scope at the beginning of the src/hash.rs file) and arrive here. Documentation is nonexistent so we jump to the code. Here are the relevant lines:

pub struct Parameters<C: ProjectiveCurve> {
    pub generators: Vec<Vec<C>>,
}

pub struct CRH<C: ProjectiveCurve, W: Window> {
    group: PhantomData<C>,
    window: PhantomData<W>,
}

impl<C: ProjectiveCurve, W: Window> CRHTrait for CRH<C, W> {
    const INPUT_SIZE_BITS: usize = W::WINDOW_SIZE * W::NUM_WINDOWS;
    type Output = C::Affine;
    type Parameters = Parameters<C>;

    fn setup<R: Rng>(rng: &mut R) -> Result<Self::Parameters, Error> {
        // ...
        let generators = Self::create_generators(rng);
        // ...
        Ok(Self::Parameters { generators })
    }

    // ...
}

impl<C: ProjectiveCurve, W: Window> CRH<C, W> {
    pub fn create_generators<R: Rng>(rng: &mut R) -> Vec<Vec<C>> {
        let mut generators_powers = Vec::new();
        for _ in 0..W::NUM_WINDOWS {
            generators_powers.push(Self::generator_powers(W::WINDOW_SIZE, rng));
        }
        generators_powers
    }

    pub fn generator_powers<R: Rng>(num_powers: usize, rng: &mut R) -> Vec<C> {
        let mut cur_gen_powers = Vec::with_capacity(num_powers);
        let mut base = C::rand(rng);
        for _ in 0..num_powers {
            cur_gen_powers.push(base);
            base.double_in_place();
        }
        cur_gen_powers
    }
}

Each invocation of generator_powers draws a random group element $B \leftarrow_{$} G_{1}$ and returns the vector $(B, 2 B, \dots, 2^{w - 1} B)$ where $w ≅$ W::WINDOW_SIZE. This function is called $n ≅$ NUM_WINDOWS times by create_generators which then returns a vector $((B_{0}, \dots, 2^{w - 1} B_{0}), \dots, (B_{n - 1}, \dots, 2^{w - 1} B_{n - 1}))$ where $B_{0}, \dots, B_{n - 1}$ are random group elements. In hash_to_curve, this function is called with constants WINDOW_SIZE = 1 and NUM_WINDOWS = 256 as defined in the implementation of trait Window for struct ZkHackPedersenWindow. Hence, the line

let parameters = CRH::<G1Projective, ZkHackPedersenWindow>::setup(rng_pedersen).unwrap();

defines a Parameters<G1Projective> struct whose field generators holds a tuple of 256 random group elements $(B_{0}, \dots, B_{255})$ of type G1Projective.

Then, the message is hashed with hash function BLAKE2s and the result is passed to the evaluate function, whose core is as follows:

    fn evaluate(parameters: &Self::Parameters, input: &[u8]) -> Result<Self::Output, Error> {
        // ...

        // Compute sum of h_i^{m_i} for all i.
        let bits = bytes_to_bits(input);
        let result = cfg_chunks!(bits, W::WINDOW_SIZE)
            .zip(&parameters.generators)
            .map(|(bits, generator_powers)| {
                let mut encoded = C::zero();
                for (bit, base) in bits.iter().zip(generator_powers.iter()) {
                    if *bit {
                        encoded += base;
                    }
                }
                encoded
            })
            .sum::<C>();

        // ...

        Ok(result.into())
    }

First, the input is converted into a vector of booleans $(b_{0}, \dots, b_{ℓ - 1})$ using the bytes_to_bits function from the pedersen module. Then, it is split into $n ≅$ NUM_WINDOWS chunks of size $w ≅$ WINDOW_SIZE and zipped with parameters.generators which contains the points $((B_{0}, \dots, 2^{w - 1} B_{0}), \dots, (B_{n - 1}, \dots, 2^{w - 1} B_{n - 1}))$ returned by setup. The closure inside map takes a chunk of bits $(b_{0}, \dots, b_{w - 1})$ and a vector of points $(B, \dots, 2^{w - 1} B)$ and returns $i = 0 \sum w - 1 b_{i} 2^{i} B = βB$ where $β : = \sum_{i = 0}^{w - 1} b_{i} 2^{i}$ is the integer whose bit representation is $(b_{0}, \dots, b_{w - 1}) .$ The final value of result is the sum over the $n$ windows of the output of this closure, i.e., $j = 0 \sum n - 1 i = 0 \sum w - 1 b_{w j + i} 2^{i} B_{j} = j = 0 \sum n - 1 β_{j} B_{j}$ where $β_{j} = \sum_{i = 0}^{w - 1} b_{w j + i} 2^{i}$ is the integer corresponding to the $j$ -th chunk of bits of the input.

In the specific case of hash_to_curve, we have $w = 1$ and $n = 256.$ Hence, if we let $(B_{0}, \dots, B_{255})$ denote the 256 group elements returned by setup and $h = (h_{0}, \dots, h_{255}) : = blakes2s (m)$ denote the output of the BLAKE2s hash function applied to message $m,$ seen as a vector of bits, then the hash_to_curve function applied to $m$ returns the point on $G_{1}$ defined by $H (m) : = j = 0 \sum 255 h_{j} B_{j} .$

Hence, $H$ can be seen as the composition of BLAKE2s and an instance of Pedersen hashing. Since both BLAKE2s and Pedersen hashing are collision-resistant (assuming hardness of the discrete logarithm problem for Pedersen hashing), $H$ is collision-resistant as well. Is it sufficient to make BLS signatures secure though?

Now that we understand all parts of the code, we can get down to solving the puzzle.