DiceCTF 2024 - rps casino

Writeup for the rps-casino Cryptography challenge.

This problem costed me a good 20 hours because of some stupid z3 gimmick. I think if not for that, could have solved this problem in 2 hours at max, which would have let me to dedicate more time for a 3rd crypto solve smh ๐Ÿ˜ซ.

This is the program script:

#!/usr/local/bin/python

import os
from Crypto.Util.number import bytes_to_long

def LFSR():
  state = bytes_to_long(os.urandom(8))
  while 1:
    yield state & 0xf
    for i in range(4):
      bit = (state ^ (state >> 1) ^ (state >> 3) ^ (state >> 4)) & 1
      state = (state >> 1) | (bit << 63)

rng = LFSR()
n = 56

print(f"Let's play rock-paper-scissors! We'll give you {n} free games, but after that you'll have to beat me 50 times in a row to win. Good luck!")

rps = ["rock", "paper", "scissors", "rock"]
nums = []

for i in range(n):
  choice = next(rng) % 3
  inp = input("Choose rock, paper, or scissors: ")
  if inp not in rps:
    print("Invalid choice")

    exit(0)

  if inp == rps[choice]:
    print("Tie!")

  elif rps.index(inp, 1) - 1 == choice:
    print("You win!")
  else:
    print("You lose!")

for i in range(50):
  choice = next(rng) % 3
  inp = input("Choose rock, paper, or scissors: ")
  if inp not in rps:
    print("Invalid choice")
    break
    
  if rps.index(inp, 1) - 1 != choice:
    print("Better luck next time!")
    break

  else:
    print("You win!")

else:
  print(open("flag.txt").read())

The problem flow is as follows:

  • We have to play 56 rounds of rock-paper-scissors against the server. Whatever the result is in those rounds, it does not matter.
  • Then we have to play 50 rounds more and win all of them. If we are successful, flag will be printed.
  • In both these steps, the computerโ€™s move is generated through a function called LFSR.

Linear Feedback Shift Register or in short LFSR are random number generators where the output bits depend on some linear combinations of the previous bits. https://github.com/Tsumiiiiiiii/Tsumiiiiiiii.github.io/blob/main/content/posts/dice24/lfsr_pic.svg?raw=true Let us try to understand how an LCG works with the help of the above figure.

  • The bits s0s1โ€ฆs7  โˆ€siโˆˆ{0,1}s_0s_1โ€ฆs_7 \ \ \forall s_i \in \{0, 1\} represent the seedseed or the initial state of the LFSR. If seedseed is somehow compromised, the whole secrecy of the LFSR is lost.
  • Since there are 8 initial bits, this is a 8 bit8 \ bit LFSR.
  • As previously mentioned, the next bits (s8s9โ€ฆsnโ€ฆs_8s_{9}โ€ฆs_nโ€ฆ) depend on some linear combination of the previous bits.
  • XORXOR is the linear operation here(actually xorxor is not linear in terms of traditional math, but under some specific circumstances, it does become linear which we will see in a bit). s8=s6 โŠ• s5 โŠ•s4 โŠ•s0s_8 = s_6 \ \oplus \ s_5 \ \oplus s_4 \ \oplus s_0
  • Notice how s8s_8 depends only s6,s5,s4,s0s_6, s_5, s_4, s_0 and not all of the state bits? Those bits on whom the next state depends are called taps. Taps are a key element of any LFSR. Choosing mathematically secured taps is very important to ensure the security of LFSRs. Let us generalize the taps for the above LFSR: sn=snโˆ’2 โŠ• snโˆ’3 โŠ•snโˆ’4 โŠ•snโˆ’8   โˆ€n>7s_{n} = s_{n-2} \ \oplus \ s_{n-3} \ \oplus s_{n-4} \ \oplus s_{n-8} \ \ \ \forall n > 7.
  • There is a better representation of taps, which helps us model LFSR mathematically in a very convenient manner. Remember how I said xor is linear under specific circumstances? It is linear under addition mod 2. That is, xor is denoted simply as b0โŠ•b1=(b0+b1)modโ€‰โ€‰2b_0 \oplus b_1 = (b_0+b_1) \mod 2, where biโˆˆ{0,1}b_i \in \{0, 1\}. We can thus represent the above tap equation with sn=snโˆ’2+snโˆ’3+snโˆ’4+snโˆ’8modโ€‰โ€‰2s_{n} = s_{n-2} + s_{n-3} + s_{n-4} + s_{n-8} \mod 2. Under Z/Z2\mathbb{Z}/\mathbb{Z}2 this is known as the Feedback polynomial which for the given LFSR is 1+x4+x5+x6+x81+x^4+x^5+x^6+x^8.

Suppose we want to find s99784s_{99784} for the above LFSR. Do we need to run the simulation 9978499784 times to get that?

No. There is a very neat approach to this due to the matrix representation of LFSR. Consider this matrix:

(sksk+1sk+2โ‹ฎsk+nโˆ’1)โŸnext  states= (010โ‹ฏ0001โ‹ฏโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฑ1c0c1c2โ‹ฏcnโˆ’1)โŸcompanion  matrix (T)โ‹…(skโˆ’1sksk+1โ‹ฎsk+nโˆ’2)โŸprevious  states= (010โ‹ฏ0001โ‹ฏโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฑ1c0c1c2โ‹ฏcnโˆ’1)kโŸraised  to  power kโ‹…(s0s1s2โ‹ฎsnโˆ’1)โŸinitial  states \underbrace{ \begin{pmatrix} s_{k} \\ s_{k+1} \\ s_{k+2} \\ \vdots \\ s_{k+n-1} \end{pmatrix} }_{\text{next \ states}} = \ \underbrace{ \begin{pmatrix} 0 & 1 & 0 & \cdots & 0 \\ 0 & 0 & 1 & \cdots & \vdots \\ \vdots & \vdots & \vdots & \ddots & 1 \\ c_0 & c_1 & c_2 & \cdots & c_{n-1} \\ \end{pmatrix} }_{\text{companion \ matrix} \ (T)} \cdot \underbrace{ \begin{pmatrix} s_{k - 1} \\ s_{k} \\ s_{k+1} \\ \vdots \\ s_{k+n-2} \end{pmatrix} }_{\text{previous \ states}} = \ \underbrace{ \begin{pmatrix} 0 & 1 & 0 & \cdots & 0 \\ 0 & 0 & 1 & \cdots & \vdots \\ \vdots & \vdots & \vdots & \ddots & 1 \\ c_0 & c_1 & c_2 & \cdots & c_{n-1} \\ \end{pmatrix} ^ k }_{\text{raised \ to \ power} \ k} \cdot \underbrace{ \begin{pmatrix} s_{0} \\ s_{1} \\ s_{2} \\ \vdots \\ s_{n-1} \end{pmatrix} }_{\text{initial \ states}}

This is the generalized representation of a n bitn \ bit LFSR where ak,ak+1,ak+2,โ€ฆa_k, a_{k+1}, a_{k+2}, โ€ฆ are state bits. The square matrix is known as companion matrix. This is equivalent to a transition matrix and is fixed for a particular LFSR. The c0,c1,c2,โ€ฆ,cnโˆ’1c_0, c_1, c_2, โ€ฆ, c_{n-1} in the companion matrix represents the tap bits.

What would these matrices look like for the given LFSR?

(sksk+1sk+2โ‹ฎsk+7)= (010โ‹ฏ0001โ‹ฏโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฑ1c0c1c2โ‹ฏc7)โ‹…(skโˆ’1sksk+1โ‹ฎsk+6)= (010โ‹ฏ0001โ‹ฏโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฑ1c0c1c2โ‹ฏcnโˆ’1)kโ‹…(s0s1s2โ‹ฎs7) \begin{pmatrix} s_{k} \\ s_{k+1} \\ s_{k+2} \\ \vdots \\ s_{k+7} \end{pmatrix} = \ \begin{pmatrix} 0 & 1 & 0 & \cdots & 0 \\ 0 & 0 & 1 & \cdots & \vdots \\ \vdots & \vdots & \vdots & \ddots & 1 \\ c_0 & c_1 & c_2 & \cdots & c_{7} \\ \end{pmatrix} \cdot \begin{pmatrix} s_{k - 1} \\ s_{k} \\ s_{k+1} \\ \vdots \\ s_{k+6} \end{pmatrix} = \ \begin{pmatrix} 0 & 1 & 0 & \cdots & 0 \\ 0 & 0 & 1 & \cdots & \vdots \\ \vdots & \vdots & \vdots & \ddots & 1 \\ c_0 & c_1 & c_2 & \cdots & c_{n-1} \\ \end{pmatrix} ^ k \cdot \begin{pmatrix} s_{0} \\ s_{1} \\ s_{2} \\ \vdots \\ s_{7} \end{pmatrix}

(c0c1c2c3c4c5c6c7)= (01110001) \begin{pmatrix} c_0 & c_1 & c_2 & c_3 & c_4 & c_5 & c_6 & c_7 \end{pmatrix} = \ \begin{pmatrix} 0 & 1 & 1 & 1 & 0 & 0 & 0 & 1 \end{pmatrix}

The problem of finding s99784s_{99784} becomes trivial now.

(s99784s99785s99786โ‹ฎs99791)= T99784โ‹…(s0s1s2โ‹ฎs7) \begin{pmatrix} s_{99784} \\ s_{99785} \\ s_{99786} \\ \vdots \\ s_{99791} \end{pmatrix} = \ T ^ {99784} \cdot \begin{pmatrix} s_{0} \\ s_{1} \\ s_{2} \\ \vdots \\ s_{7} \end{pmatrix}

Given is a 64-bit LFSR with 3 tap positions. Every time a game is played, 4 states of the LFSR is combined and returned which is later used after modulo by 3. Simply put, the 4 lower bits of the LFSR are used modulo 3 as the opponentโ€™s move.

move = (state  &  15)  mod  3=((s3โ‰ช3)+(s2โ‰ช2)+(s1โ‰ช1)+s0)  mod  3=(s3โ‹…8+s2โ‹…4+s1โ‹…2+s0)  mod  3 \begin{aligned} move \ &= \ (state \ \ \& \ \ 15) \ \ &mod \ \ 3 \\ &= ((s_3 \ll 3) + (s_2 \ll 2) + (s_1 \ll 1) + s_0 ) \ \ &mod \ \ 3 \\ &= (s_3\cdot8 + s_2\cdot4 + s_1\cdot2 + s_0) \ \ &mod \ \ 3 \end{aligned}

This entire operation steps can be represented using matrix operations as follows:

(movekmovek+1movek+2movek+30โ‹ฎ0)=(sk+sk+1โ‹…2+sk+2โ‹…4+sk+3โ‹…8sk+4+sk+5โ‹…2+sk+6โ‹…4+sk+7โ‹…8sk+8+sk+9โ‹…2+sk+10โ‹…4+sk+11โ‹…8sk+12+sk+13โ‹…2+sk+14โ‹…4+sk+15โ‹…80โ‹ฎ0)=(12480000โ‹ฏ000001248โ‹ฏ0โ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฎโ‹ฑ000000000โ‹ฏ0)โŸfirst  4  rows  contain  the  block (1 2 4 8 ) at  different  positionsโ‹…Tkโ‹…(s0s1s2โ‹ฎs63) mod  3 \begin{aligned} \begin{pmatrix} move_k \\ move_{k+1} \\ move_{k+2} \\ move_{k+3} \\ 0 \\ \vdots \\ 0 \end{pmatrix} & = \begin{pmatrix} s_k+s_{k+1}\cdot2+s_{k+2}\cdot4+s_{k+3}\cdot8 \\ s_{k+4}+s_{k+5}\cdot2+s_{k+6}\cdot4+s_{k+7}\cdot8 \\ s_{k+8}+s_{k+9}\cdot2+s_{k+10}\cdot4+s_{k+11}\cdot8 \\ s_{k+12}+s_{k+13}\cdot2+s_{k+14}\cdot4+s_{k+15}\cdot8 \\ 0 \\ \vdots \\ 0 \end{pmatrix} \\ & = \underbrace{ \begin{pmatrix} 1 & 2 & 4 & 8 & 0 & 0 & 0 & 0 & \cdots & 0 \\ 0 & 0 & 0 & 0 & 1 & 2 & 4 & 8 & \cdots & 0 \\ \vdots & \vdots & \vdots & \vdots & \vdots & \vdots & \vdots & \vdots & \ddots & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & \cdots & 0 \\ \end{pmatrix} }_{\text{first \ 4 \ rows \ contain \ the \ block} \ (1 \ 2 \ 4 \ 8\ ) \ \text{at \ different \ positions}} \cdot T^k \cdot \begin{pmatrix} s_{0} \\ s_{1} \\ s_{2} \\ \vdots \\ s_{63} \end{pmatrix} \ mod \ \ 3 \end{aligned}

All operations previously were on Z/Z2\mathbb{Z}/\mathbb{Z}2. It now shifts to Z/Z16\mathbb{Z}/\mathbb{Z}16 and then to Z/Z3\mathbb{Z}/\mathbb{Z}3. This last conversion is what complicates everything.

My first thought was to convert this to a graph problem. We can do a dfs where nodes represent the moves and edges represent the transition operations from one move to another. We can prune the branches based on modulo values we received from the initial 56 games.

0
1
2
15
0
1
15
0
15
sโ‚
sโ‚‚
sโ‚‚
sโ‚‚
sโ‚‚
sโ‚ƒ
sโ‚ƒ
sโ‚ƒ
sโ‚„
sโ‚„
...
...
...
...
...
...
...
...
...
...

The problem with this approach was that it gave too many valid seedsseeds. I could not figure out a way to eliminate them and keep a single seed.

Every move in the game can be represented as a constraint. We can represent the moves of the first 56 games as constraints and use a constraint solver which would give valid configurations to satisfy our constraints. A very popular constraint solver is z3. We can represent the state as a bit-vector of 64 bits and add all the constraints.

s1โ‰ก1modโ€‰โ€‰3s2โ‰ก2modโ€‰โ€‰3s3โ‰ก0modโ€‰โ€‰3โ‹ฎs56โ‰ก1modโ€‰โ€‰3}these  are  constraints \begin{aligned} \begin{rcases} s_1 &\equiv 1 \mod 3 \\ s_2 &\equiv 2 \mod 3 \\ s_3 &\equiv 0 \mod 3 \\ \vdots \\ s_{56} &\equiv 1 \mod 3 \end{rcases} \text{these \ are \ constraints} \end{aligned}

In my solution, I represented each bit as a variable. But navigating z3 is โ€ฆ a bit sophisticated. I had to go through a lot of trial and error to make it work.

bits = [ BitVec('f_%s' % i, 4) for i in range(n - 1, -1, -1) ]

S = Solver()
for i in range(n):
  S.add(ULT(bits[i], 2))
  S.add(UGE(bits[i], 0))

for i in range(n - 8):
  assert len(bits) == n
  relevant = bits[-4:]
  assert(len(relevant) == 4)
  num = (relevant[0] << 3) + (relevant[1]  << 2) + (relevant[2]  << 1) + relevant[3]
  diff = num - vals[i]
  rem = URem(diff, 3)
  S.add(rem == 0)
  for j in range(4):
    first = bits[-1] + bits[-2] + bits[-4] + bits[-5]
    bits = [first & 1] + bits[:-1]
    
print(S.check())
Note
For some reason that I havenโ€™t figured out yet, BitVec('f_%s' % i, 1) did not work. But somehow BitVec('f_%s' % i, 4) worked properly.

Once we have the seedseed, we can win the next 50 games easily and claim the flag.

rng = LFSR(seed)

for _ in range(56):
  _ = next(rng)
  
moves = ["rock", "paper", "scissors"]

for _ in range(50):
  move = next(rng) % 3
  win = moves[(move + 1) % 3]
  recvuntil(sock, b': ')
  sendline(sock, win.encode())

print(recvline(sock))

dice{wow_u_must_be_extremely_lucky_91ff5a34}