IPSC 2008 Problem H – Hidden Text

IPSC 2008

Problem H – Hidden Text

A common practice when displaying a document that contains sensitive information is to post-process it to make the sensitive parts not readable, for example by blurring or pixelizing them.

However, in many situations this practice is not sufficient. An attacker might be able to restore the scrambled part of the text, at least partially. That will be your job in this problem.

Problem specification

You are given two image files as input. The easy input is a clear image with black letters and white background, the hard input contains various kinds of noise as well. In both cases, the image has been post-processed to hide a part of the information it contains. Recover it, and submit an appropriate proof.

Blurring algorithm used

The blurred part of the image was produced using the following algorithm:

Pick a positive integer σ. We define

G = e-(r2+c2)∕(2σ2) r,c

Compute the values G_r,c for -3σ ≤ r,c ≤ 3σ. Let S = ∑ _{-3σ≤r,c≤3σ}G_r,c. Set H_r,c = G_r,c∕S.

For example, for σ = 1 you should get the following values: S = 6.279785, and the values H_r,c look as follows (with H_0,0 in the middle):

0.000020  0.000239  0.001073  0.001769  0.001073  0.000239  0.000020
0.000239  0.002917  0.013071  0.021551  0.013071  0.002917  0.000239
0.001073  0.013071  0.058582  0.096585  0.058582  0.013071  0.001073
0.001769  0.021551  0.096585  0.159241  0.096585  0.021551  0.001769
0.001073  0.013071  0.058582  0.096585  0.058582  0.013071  0.001073
0.000239  0.002917  0.013071  0.021551  0.013071  0.002917  0.000239
0.000020  0.000239  0.001073  0.001769  0.001073  0.000239  0.000020

To blur a greyscale image, first represent each pixel [pr,pc] as an integer I_pr,pc between 0 and 255, inclusive. Here, 0 is black, and 255 white. Pixels outside of the image are considered to be white. The color B_pr,pc of a pixel [pr,pc] after blurring can be computed as follows:

∑ Bpr,pc = Hr,c * Ipr+r,pc+c - 3σ≤r,c≤3σ

In words, the new color is a weighted average of the colors of the nearby pixels, and the weights used are the values H_r,c defined above. The exact color B_pr,pc is then rounded to the nearest integer from the set {0,…,255}.

For a true color image, blurring is done separately for its red, green, and blue color component.

For the easy data set we used the value σ = 15, for the hard data set σ = 17.

Input specification

The input is a Portable Network Graphics (PNG) file.

(See http://en.wikipedia.org/wiki/Portable_Network_Graphics if you are not familiar with this image format. All modern browsers and all modern image manipulation programs should be able to open PNG files.)

Output specification

For each input file, submit an output file that contains a single English word in ALL UPPERCASE. If you submit the correct output but not in ALL UPPERCASE, it will be considered incorrect.

The word you are supposed to output is uniquely specified by the hidden part of the input file. For the hard data set, the word has between 6 and 9 letters, and starts with the letter P.

Example

Input image

Original image without the distortion

Output file

BULL

Internet Problem Solving Contest

IPSC 2008

Problem H – Hidden Text