Fall 2025: Topics in Information-Theoretic Cryptography

Since the beginning of this year, I have been developing a course on “Topics in Information-Theoretic Cryptography”. Recently, the course was approved for Fall 2025. I’m very excited to share some research with undergraduate/graduate students! Below, I list some relevant information for the proposed course.

Course Number and Title

ECE598DA: Topics in Information-Theoretic Cryptography

Description

In this course, we will study foundational and recent work on the use of information theory to design and analyze cryptographic protocols. We will begin by studying privacy attacks which motivate strong privacy and security definitions. Then, we will explore the basics of differential privacy and study some core works on zero-knowledge proofs. Finally, we will explore various applications, including watermarking of generative models.

Syllabus

Week 1: Introduction: motivations, one-time pad review, review of probability theory

Week 2: Attacks and Composition Theorems for Differential Privacy

Week 3: Standard Mechanisms for Differential Privacy

Week 4: Information-Theoretic Lower Bounds for Differential Privacy

Week 5: Differentially Private Statistical Estimation and Testing

Week 6: Zero-Knowledge Proofs

Week 7: Statistical Zero-Knowledge Proofs: Part I

Week 8: Statistical Zero-Knowledge Proofs: Part II

Week 9: Multi-Party Computation

Week 10: Multi-Party and Computational Differential Privacy

Week 11: Code-Based Cryptography: Part I

Week 12: Code-Based Cryptography: Part II

Week 13: More Applications

Watermarking of Generative Models
Proof Systems for Machine Learning
Bounded-Storage Cryptography
Quantum Cryptography

Week 14: Project Presentations

Watermarking Language Models

Lav Varshney and I recently released a IACR preprint on how to analyze unforgeable watermarking procedures for generative agents. Our approach relies on cryptographic techniques and computational entropy notions.

Abstract

In this work, we construct distortion-free and unforgeable watermarks for language models and generative agents. The watermarked output cannot be forged by an adversary nor removed by the adversary without significantly degrading model output quality. That is, the watermarked output is distortion-free: the watermarking algorithm does not noticeably change the quality of the model output and without the public detection key, no efficient adversary can distinguish output that is watermarked from outputs which are not. The core of the watermarking schemes involve embedding a message and publicly-verifiable digital signature in the generated model output. The message and signature can be extracted during the detection phase and verified by any authorized entity that has a public key. We show that, assuming the standard cryptographic assumption of one-way functions, we can construct distortion-free and unforgeable watermark schemes. Our framework relies on analyzing the inaccessible entropy of the watermarking schemes based on computational entropy notions derived from the existence of one-way functions.

The Errors in Our Way

This blog post is written for a general audience.

In today’s digital age, where information flows across networks at lightning speed, ensuring data integrity is crucial. Whether it’s a message sent over a noisy communication channel, data stored in memory, or even a barcode scanned at the supermarket, errors can occur during transmission or retrieval. This is where error-correcting codes (ECC) come into play, enabling systems to detect and correct errors automatically.

What Are Error-Correcting Codes?

Error-correcting codes are algorithms used to encode and decode data in such a way that errors introduced during transmission or storage can be detected and, in many cases, corrected (very important distinction!). These codes add redundancy to the original data, allowing the receiver to recognize and fix errors without requiring retransmission.

Types of Error-Correcting Codes

There are two broad categories of ECC:

1. Block Codes

Block codes work by encoding a fixed block of data at a time. Each block is transformed into a longer block that includes extra bits for error detection and correction. Examples include:

Hamming Codes: Introduced by Richard Hamming in the 1950s, these codes can detect and correct single-bit errors.
Reed-Solomon Codes: Widely used in CDs, DVDs, QR codes, and deep-space communication, they are effective at correcting burst errors.
Bose-Chaudhuri-Hocquenghem (BCH) Codes: Used in wireless communication and storage devices for robust error correction.

2. Convolutional Codes

Unlike block codes, convolutional codes process data continuously by encoding each bit in the context of previous bits. They are commonly used in real-time communication, such as satellite and mobile phone signals. The Viterbi Algorithm is often used to decode convolutional codes efficiently [1].

How Do Error-Correcting Codes Work?

At a high level, ECC techniques follow these steps:

Encoding: The original data is transformed using an encoding algorithm that introduces redundancy.
Transmission or Storage: The encoded data is sent over a channel or stored in a medium where errors might occur.
Error Detection: At the receiver’s end, the received data is analyzed to identify errors using parity checks or syndrome decoding.
Error Correction: If errors are detected, the redundancy bits help reconstruct the original message.

Specific Examples of Encoding and Decoding Functions

Hamming Code (7,4) Encoding:

A simple example of Hamming code encoding involves a 4-bit data input (D1, D2, D3, D4) and generating three parity bits (P1, P2, P3) using the following formulas:

P1 = D1 ⊕ D2 ⊕ D4
P2 = D1 ⊕ D3 ⊕ D4
P3 = D2 ⊕ D3 ⊕ D4

The final 7-bit encoded message is: P1 P2 D1 P3 D2 D3 D4. See [3].

Hamming Code Decoding:

On the receiving end, the parity bits are recomputed and compared with the received parity bits to detect errors. The syndrome vector determines the error location, and if necessary, the erroneous bit is flipped to correct the error.

Reed-Solomon Encoding and Decoding:

In Reed-Solomon coding, encoding is performed by treating data as coefficients of a polynomial over a finite field. Redundant parity symbols are generated using polynomial division. Decoding involves using error-locator polynomials (e.g., Berlekamp-Massey algorithm) to identify and correct errors [4].

Binary Golay Code Encoding and Decoding:

The (23,12) Binary Golay Code is a linear block code that encodes 12-bit data into a 23-bit codeword by adding 11 parity bits. The code is known for its ability to correct up to 3-bit errors and detect up to 7-bit errors.

Encoding: The 12-bit input message is multiplied by a generator matrix to produce the 23-bit codeword.
Noise Transmission: Errors may be introduced as the codeword travels through a noisy channel.
Decoding: The receiver computes a syndrome using the parity-check matrix and applies an efficient decoding algorithm to correct errors if they fall within the error correction capability.

The Golay code is used in deep-space communication and digital broadcasting [5].

Applications of Error-Correcting Codes

Error-correcting codes are used in a variety of fields, including:

Digital Communications: Used in Wi-Fi, mobile networks (e.g., 4G, 5G), and satellite communications to ensure reliable data transmission.
Storage Devices: Hard drives, SSDs, and RAM use ECC to prevent data corruption.
Deep Space Communication: NASA and other space agencies rely on ECC to communicate with spacecraft over vast distances.
Barcodes and QR Codes: Reed-Solomon codes allow damaged or partially obscured barcodes to be accurately scanned.
Financial and Banking Systems: Secure data transmission in transactions and cryptographic applications.

Conclusion

As technology advances, so does the need for more efficient error correction methods. Emerging fields such as quantum error correction are gaining traction, ensuring the reliability of quantum computing and communication [2]. AI-driven ECC algorithms are also being explored to improve real-time error detection and correction.

Error-correcting codes remain a fundamental component of modern computing and communications, safeguarding data integrity in an increasingly connected world. Whether you’re streaming a video, making a phone call, or sending a deep-space probe, ECC is (probably) silently working behind the scenes to keep your data accurate and reliable.

Some References

[1] https://en.wikipedia.org/wiki/Viterbi_decoder

[2] https://en.wikipedia.org/wiki/Quantum_error_correction

[3] https://en.wikipedia.org/wiki/Hamming(7,4)

[4] https://en.wikipedia.org/wiki/Reed%E2%80%93Solomon_error_correction

[5] https://en.wikipedia.org/wiki/Binary_Golay_code

Why (and How) Things Work

In Honor of David Blackwell