This is an old version of the compendium, written July 28, 2019, 9:14 p.m. Changes made in this revision were made by EvenMF. View rendered version.

TTM4135: Applied Cryptography and Network Security

# Preface This compendium will attempt to explain the core concepts of the subject. Feel free to edit, add or remove content if you feel like it would enhance the content. # Introduction This section serves as an introduction to what the course is about, but some important concepts are introcduced. ## Definitions of fundamental concepts Let's start of with defining what this is all about. Information security : Minimizing vulnerabilities of information assets. From this we can define several other concepts. Vulnerability : Any weakness that could be exploited to violate a system or the information it contains. Information assets : Can include data, software and hardware, people and even buildings. Threat : A threat is a potential violation of security. All of these are so fundamental that you will most likely not be asked about this on the exam, this is just to provide some clarity when explaing more advanced topics. ## The CIA triad Traditional definitions of information security are based on three information security goals: Conﬁdentiality : Preserving authorised disclosure of information. Integrity : Preventing unauthorised (accidental or deliberate) modification or destruction of information. Availability : Ensuring resources are accessible when required by an authroised user. ## Passive and Active threats When concidering threats, there are two main types: passive and active. Passive threats : Passive threats do not alter information in the system. Such attacks may be eavesdropping and traffic analysis. Active threats : Active threats alter information in the system. Such attacks may be masquerade, replay, modification of messages, denial of service. ## Securit services and mechanisms Security service : A processing or communication service to give a specific kind of protection to system resources. Security mechanism : A method of implementing one or more security services. # Classical Encryption ## Exhaustive key search The most basic method of attack is exhaustive key search, sometimes called a brute-force attack, in which the adversary tries all possible keys. This attack cannot be prevented so all cryptosystems must have enough keys to make exhaustive search too difficult computationally. 128 bits is the samllest key size which would be acceptable to prevent exhaustive key search today. ## Kerckhoffs' Principle Kerckhoffs’ principle : The attacker has complete knowledge of the cipher (i.e the decryption key is the only thing unknown to the attacker. This one is asked about a lot. Make sure you know this one. ## Statistics of Natural Language Many of the classical ciphers can be broken by studying the statistics of the messages. If you have a reasonably long message written in English, it is very likely that the most used character will be E, N, T, I and so on. By studying the character frequency of an encrpyted message you can often try to guess which character is supposed to be which (it will of course largely depend on the cipher used, and may not work at all on some ciphers. ## Transposition ciphers These types of ciphers are mosly based on switching the positions of the letters that appear in the message. For a transposition cipher the distributions of plaintext and cihpertext are the same. ## Substitution ciphers These types of cihpers are mostly based on switching out individual characters by other characters. For example switching E with Z or T with C. You can figure out if a simple substitution cipher has been used if the character frequency looks similar to the standard english character frequency only that other letters are used. A simple substitution cipher has n! different keys, where n is the size of the alphabet. ## Vigenére cipher If aplaintext comes from a natural language, such as English, the Vigenére cipher can be expected to have a "flat" frequency distribution of characters. Cryptoanalyiss of the Vigenére cipher often uses autocorrelation in order ot identify the period (key length) ## Hill Cipher The Hill cipher is a historical cipher with the encryption equation C = KP mod n for k × k key matrix K and vectors C and P representing the ciphertext and plaintext. A fundamental weakness of the Hill cipher is that encryption is a linear function so a plaintext attack is easy. # Block Ciphers

Product cipher : A product cipher is a cryptosystem in which the encryption function is formed by applying (or composing) several sub-encryption functions. Iterated cipher : An encrpytion process that is divided into r similar rounds, the subencryption functions are all the same function, g, called the round function. Each key is dervied from the overall master key K. The keys are called round keys or subkeys and are derived from K using a process called the key schedule. Most modern ciphers ~~in this category, read more bel~~are in a special class of product ciphers known as iterated ciphers. ~~Product cipher~~ There are two types of iterated cihpers: Substitution-permutation network : A ~~product cipher is a cryptosystem in~~ n iterated cipher. The block length must allow~~hich the encryption function is formed by applying (or composing) several~~ each block to be split into sub-~~encryption function~~blocks. Substitution-permutation network : An iterated cipher. Takes a block of the plaintext and the key as inputs, and applies several rounds of permutations, known as substitution boxes(S-box) and permutation boxes (P-box). A __S-box__ substitutes sub-blocks of size l bits (its input) by another block of bits (its output). It can be thought of as a substitution cipher. A __P-box__ takes the output from the S-boxes of one round, permutes the bits and feed them into the S-boxes in the next round. It can be thought of as a transposition cipher. At each round, the __round key__ is combined with some operation such as XOR and permutation is then performed in a certain fashion.

Feistel cipher : An iterated cipher in which the round function swaps the two halves of the block and forms a new right hand half

## Data encryption standard (DES) - DES is a 16-round Feistel chiper with key length of 56 bits and data block length of 64 bits. - DES is no longer concidered secure.

### Triple-DES To increase the security of DES, the algorithm may be run multiple times. Two times would be the simplest, but is vulnerable to a _meet-in-the-middle_ attack. To counter this, three runs are needed. This is often implemented as Encrypt-Decrypt-Encrypt. This allows backwards compatibility with normal DES, by using the same key for all three steps: $$ E(K_1, D(K_1, E(K_1, X))) = E(K_1, X) $$ While 3DES takes three keys as parameters, using it with only two keys is enough to stop the meet-in-the-middle attack, and often good enough: $$ E(K_1, D(K_2, E(K_1, X))) $$ Some applications, like PGP and S/MIME, still use three keys with 3DES.

## AES - Symmetric key block cipher - 128-bit data block; 128-, 192- or 256-bit master key - Number of round, NR, is 10, 12 or 14 (for 128-, 192-, 256-bit keys) - Byte-based design - Structure is essentially a substitution-permutation network

Consists of four stages: Substitute bytes : Uses an S-box to perform a byte-by-byte substitution of the block ShiftRows : A simple permutation MixColumns : A substitution that makes use of arithmetic over $GF(2^8)$ AddRoundKey : A simple bitwise XOR of the current block with a portion of the expanded key Only the `AddRoundKey` stage makes use of the key. The other three stages provides confusion, diffusion and non-linearity, but no security in themselves. When decrypting with AES, the inverses of the three first stages are used. The `AddRoundKey` stage is the same, because $ A \oplus B \oplus B = A $.

## Modes of operation ### Electronic Codebook (ECB) Mode - Randomised: No - Padding: Required - Error propagation: Errors propagate within blocks - IV: None - Parallel encryption: Yes - Parallel decryption: Yes ### Cipher Block Chaining (CBC) Mode - Randomised: Yes - Padding: Required - Error propagation: Errors propagate within blocks and into specific bits of next block - IV: Must be random - Parallel encryption: No - Parallel decryption: Yes ### Counter (CTR) Mode - Randomised: Yes - Padding: Not required - Error propagation: Errors occur in specific bits of current block - IV: Nounce must be unique - Parallel encryption: Yes - Parallel decryption: Yes ### CMAC (CBC-based MAC) Integrity and confidentiality? ### CCM mode (Counter mode with CBC MAC) This mode combines CBC-MAC for authentication of all data (payload plus associated data) and CTR encryption for the payload. Authenticated encrpytion. # Randomness TRNG : A true random number generator is a physcial process which outputs each valid string indpenedently with equal probability. PRNG : A psuedo random number generator (PRNG) is a deterministic algorithm which approximates a TRNG. - TRNGs can be constructed from physical devices and used as seeds for PRNGs - PRNGs can be constructed from other primitives including block ciphers. - TRNGs cna be used to make unbreakable encryption via the one time pas. - PRNGs can be used as practical synchronous stream ciphers. ## One-time pad A stream cipher that provides perfect secrecy. The key is a random sequence of characters, all of them independantly generated. Each character in the key is used one time only. A non periodic binary synchronous stream cipher. ## Linear Feedback Shift Registers (LFSR) ## The A5 cipher - The A5 is a binary synchronous stream cipher applied today in most GSM mobile telephones. - The A5/1 alogirhtm uses three linear feedback shift registers (LFSRs) whose output is combined. - The three LFSRs are irregularly clocked which means that the overall output is non-linear. The effective key length is 54 bits. ## RC4 - A word-based stream cipher. - Simple and efficient for software implementation - Too weak to be used today. # Transport Layer Security (TSL)

Transport Layer Security (TSL) is one of the most widely used security services today. It's a general purpose service implemented as a set of protocols that rely on TCP. It can be implemented as part of the underlying protocol suite, or alternatively embedded in specific packages. TLS is designed to make use of TCP to provide a reliable end-to-end secure service. TLS consists of three important protocols: the Handshake Protocol, the Record Layer Protocol, and the Alert Protocol. ## The Handshake Protocol The Handshake Protocol is the most complex part of TLS. This protocol allows the server and client to authenticate each other and to negotiate an encryption and MAC algorithm and cryptographic keys to be used to protect the data sent in a TLS record. The Handshake Protocol is used before any application data is transmitted. The Hanshake Protocols consists of 4 different phases: Phase 1 : Establish security capabilities, including protocol version, session ID, cipher suite, compression method, and intitial random numbers. Phase 2 : Server may send certificate, key exchange, and request certificate. Server signals end of hello message phase. Phase 3 : Client sends certificate if requested. Client sends key exchange. Client may send certificate verification. Phase 4 : Change cipher suite and finish handshake protocol. ## The Record protocol The record protocol provides confidentiality and message integrity. The record protocol takes an application message to be transmitted, fragments the data into manageable blocks, optionally compresses the data, applies a MAC, encrypts, adds a header, and transmits the resulting unit in a TCP segment. Received data are decrypted, verified, decompressed, and reassembled before being delivered to higher-level users. ## The Alert Protocol The Alert Protocol is used to convey TLS-related alerts to the peer entity. As with other applications that use TLS, alert messages are compressed and encrypted, as specified by the current state. Each message in this protocol consists of two byes. The first byte takes the value warning or fatal to convey the severity of the message. If the level is fatal, TLS immediately terminates the connection. Other connections on the same session may continue, but no new connections on this session may be established. The second byte contains a code that indicates the specific alert. ## TLS Security - Different kinds of attacks: implementation errors, poor choice of cryptographic primitives, flaws in protocol. - Backward compatibility is a problem (downgrade attacks). - Several examples of the principle that “attacks only get better” over time. - TLS 1.3 simplifies the handshake. - TLS 1.3 adds new features (e.g. 0-RTT mode) which present new challenges. # IP Security - IPSec are located at the network layer. - IPsec Encapsulating Security PayLoad (ESP) protocol can operate either in transport mode or in tunnel mode. Transport mode : iphost-to-iphost security where protection covers to the payload of IP-data. Encrypts IP payload and any IPv6 extension headers following the ESP header. Tunnel mode : Network/gateway-to-network/gateway security. Protection for the entire IP-packet by adding outer IP-packet between the network gateways. - Internet Protocol security (IPsec) is a framework for ensuring secure communications over Internet Protocol (IP) networks. - It provides similar security services as TLS, but at a lower layer in the communications protocol stack. - Security can be added to both IPv4 and IPv6. - IP level security encompasses three functional areas: authentication, confidentiality, and key management. - The authentication mechanism assures that a received packet was, in fact, transmitted by the part identified as the source in the packet header. In addtition, this mechanism assures that the packet has not been altered in transit. - The confidentiality facility enables communicating nodes to encrypt messages to prevent eavesdropping by third parties. - The key management facility is concerned with the secure exchange of keys. ##Chapter 1 Data origin authentication : provides confirmation of the claimed source (origin) of a data unit (message) Entity authentication : provides confirmation of the claimed identity of an entity Non-repudiation : Ensures that you cannot hide the source (origin) of a data unit (message). Substitution : each character (or set of characters) is replaced by a different character (or set of characters) Transposition : the characters in the plaintext are mixed up with each other (permuted) ## Chapter 2 Confusion : This involves substitution to make the relationship between the key and ciphertext as complex as possible. Diffusion : This involves transformations that dissipate the statistical properties of the plaintext across the ciphertext. (A small change in plaintext should give a completely different ciphertext) Group generator : A group generator is a group element g that in the operation __$g^k \mod p$__ represents all the elements that are relatively prime to p. Consider for example the group G $\mathbb{Z}_5$. G consists of the elements {1, 2, 3, 4}. One or several of these group elements may be a group generator for G. A way to check this is to use __$g^k \mod 5$__ for the k group elements and see if they represent all the group elements. Example: $2^1 =2 \mod 5 = 2$ $2^2 =4 \mod 5 = 4$ $2^3 =8 \mod 5 = 3$ $2^4 =16 \mod 5 = 1$ 2 is a group generator for $\mathbb{Z}_5$ because all its elements ({1, 2, 3, 4}) are represented. Finite ﬁeld : placeholder ## Chapter 3 Electronic Code Book (ECB) Mode : The basic mode of a block cipher. Plaintext block $P$ is encrypted with key $K$ to produce ciphertext block $C_t$. Ciphertext block $C$ is decrypted with key $K$ to produce plaintext block $P_t$. ![ECB-image](https://image.ibb.co/f2nV0o/Capture.png) Cipher Block Chaining (CBC) Mode : A random initialization vector (IV) is chosen and sent together with the ciphertext blocks. $C_t=E(P_t\oplus C_{t-1},K)$ where $C_0=IV$ $P_t=D(C_t,K)\oplus C_{t-1}$ where $C_0=IV$ ![CBC-image](https://image.ibb.co/eCTnfo/Capture.png) CTR Mode : CTR is a synchronous stream cipher. The keystream is generated by encrypting successive values of a "counter", initialised using a nonce (randomly chosen value) N: $O_t=E(T_t,K)$, where $T_t=N||t$ is the concatenation of the nonce and block number t. Encryption: $C_t=O_t\oplus P_t$. Decryption: $P_t=O_t\oplus C_t$. ![CTR-image](https://image.ibb.co/mSz4RT/Capture.png) True Random Number Generator (TRNG) : is a physical process which outputs each valid string independently with equal probability Pseudo Random Number Generator (PRNG) : is a deterministic algorithm which approximates a TRNG. Message Authentication Code (MAC) : A message authentication code (MAC) is a cryptographic mechanism used for ensuring message integrity. A MAC tag should at least be of length $\log_2{I/R}$ where $I$ is a limit on how many invalid messages are detected before the key is changed and $R$ is the acceptable probability that a false message is accepted. Stream Cipher : Stream ciphers are characterised by the generation of a keystream using the short key and an initialisation value as input. Each element of the keystream is used successively to encrypt one or more ciphertext characters. Stream ciphers are usually symmetric key ciphers: sender and receiver share the same key and can generate the same keystream given the same initialisation value. Synchronous stream ciphers : The keystream is generated independently of the plaintext. Both sender and receiver need to generate the same keystream and synchronise on its usage. One Time Pad : The key is a random sequence of characters, all of them independently generated. Each character in the key is used one time only. The one-time pad provides perfect secrecy. Linear Feedback Shift Register : A LFSR is a common component in design of stream ciphers. ## Chapter 4 Factorisation problem : Given an integer of *m* bits, find its prime factors. Factorisation by trial division is an exponential time algorithm and is hopeless for numbers of a few hundred bits. A number of special purpose methods exist, which apply if the integer to be factorised has special properties. The best current general method is known as the number field sieve. The number field sieve is a sub-exponential time algorithm. Discrete logarithm problem : Let g be a generator of $\mathbb{Z}_p^*$ for a prime p. The discrete log problem over $\mathbb{Z}_p^*$ is: **given y in $\mathbb{Z}_p^*$ find x with y = $g^x$ mod p.** Big O-notation : placeholder Fermat test : *if* a number p is prime then $a^{p−1}$ mod p = 1 for all a with gcd(a, p) = 1. If we examine a number n and find that $a^{n−1}$ mod n $\neq$ 1 then we know that n is not prime. Miller-Rabin test : placeholder RSA Encryption equation : placeholder RSA Decryption equation : placeholder RSA Padding : placeholder Prime number theorem : placeholder Håstad's attack : placeholder

## Chapter 5 Generator of $\mathbb{Z}_p^*$ : placeholder Diffie-Hellman key exchange : placeholder Elgamal cryptosystem : placeholder Collision resistance : placeholder Second preimage resistance : placeholder One-wayness : placeholder Birthday paradox : placeholder HMAC : placeholder GCM Mode : placeholder ## Chapter 6 Digital signature : placeholder Existential forgery : placeholder Selective forgery : placeholder Digital Signature Algorithm (DSA) : placeholder Key predistribution : placeholder Session key distribution : placeholder Key agreement : placeholder Needham-Schroeder protocol : placeholder Kerberos : placeholder TODO: Scrape definitions from all exercises (currently they are just from exercise 1-6) # Ciphers

## Symmetric cipher (secret key cipher) encryption and decryption keys known only to sender and receiver. (DES) ## Asymmetric cipher (public key cipher) each participant has a public key and a private key, may allow both encryption and signatures. (RSA) ## Attacks ### Ciphertext only attack the cryptanalyst has available only the intercepted cipher text. ### Known plaintext attack the cryptanalyst knows a small amount of plaintext and its cipher text equivalent ### Chosen plaintext attack The cryptanalyst can obtain the cipher text equivalent of some plaintext which can be selected by the attacker, i.e the attacker has an "inside encryptor” available ### Chosen cipher text attack The cryptanalyst can obtain the plaintext equivalent of some cipher text which can be selected by the attacker, i.e. the attacker has an “inside decryptor” available. ## Historical ciphers - Caesar - Substitution - Vigenère. Caesar, but also uses a key in order to choose how many steps to shift the alphabet for each letter #### Key expansion Input : 16-byte key. Output : 176 bytes (44 words) The first four words are used in the initial AddRoundKey step. The next ten word-groups are used in the ten rounds of the cipher. ### RSA # Iterated ciphers - Encryption process divided into _r_ similar rounds - the sub encryption functions _g_ are the same for all rounds - Each key $K_i$ is derived from the overall master key K. The keys $K_i$ are called round keys or subkeys and are derived from K using a process called the key schedule. ## Encryption Given a plaintext block, P, a round function g and round keys $K_1, K_2, . . . ,K_r$ , the ciphertext block, C, is derived through r rounds as follows. $$W_{0} = P$$ $$W_{1} = g(W_{0}, K_{1})$$ $$W_{2} = g(W_{1}, K_{2})$$ $$.$$ $$.$$ $$.$$ $$W_{r} = g(W_{r−1}, K_{r})$$ $$C = W_{r}$$ # Pseudorandom number generation Generated numbers should be as random as possible. This is defined by two criteria: Uniform distribution : The frequency of occurence of ones and zeros should be approximately equal. Independence : It should not be possible to infer a subsequence from any other.