Photometric limits for digital camera systems

Michael Schöberl; André Kaup; Andreas Brückner; Siegfried Fößel

doi:10.1117/1.JEI.21.2.020501

15 June 2012 Photometric limits for digital camera systems

Michael Schöberl, André Kaup, Andreas Brückner, Siegfried Fößel

Author Affiliations +

Journal of Electronic Imaging, Vol. 21, Issue 2, 020501 (June 2012). https://doi.org/10.1117/1.JEI.21.2.020501

Abstract

Image sensors for digital cameras are built with ever decreasing pixel sizes. The size of the pixels seems to be limited by technology only. However, there is also a hard theoretical limit for classical video camera systems: During a certain exposure time only a certain number of photons will reach the sensor. The resulting shot noise thus limits the signal-to-noise ratio. In this letter we show that current sensors are already surprisingly close to this limit.

1. Introduction

The steady progress in semiconductor technology allows the manufacturing of smaller and smaller structures and image sensors with ever shrinking pixel sizes. One can get the impression that the pixel size is just limited by the technology and even smaller pixels are desirable. Today, consumer products with pixel sizes $d_{p} = 1.4 μm$ are already on the market and devices with $d_{p} = 1.1 μm$ are in production.¹ In comparison, photo receptors in the human eye are reported to be larger than 3 μm.²

The general modeling of light is well understood,³ and simulation with commercial tools like ISET⁴ is possible. In contrast, this letter addresses parameters like aperture and pixel size and their photometric consequences for modeling the amount of light that is available in a digital video camera system. One of the design parameters is the resulting image quality. With small pixels only a few photons will hit a single pixel during an exposure period and the signal-to-noise power ratio (SNR) will be poor due to shot noise.⁵ Apart from all technological limitations, this physical boundary limits the performance of today’s video cameras.

2. Image Acquisition Model

The scene radiates a certain amount of light. This is described by an average radiance in object space $L_{obj}$ . The sensor sees an effective amount of light equivalent to the cone with a solid angle $Ω$ as shown in Fig. 1(a). This cone is defined by the sphere of radius equal to the focal length $f$ and a circular aperture disk with diameter $D$ . The solid angle thus calculates to⁶

Eq. (1)

Ω = \frac{π \cdot {(D / 2)}^{2}}{f^{2}} = \frac{π}{4 \cdot {(f / D)}^{2}} [sr] .

The sensor receives an irradiance

I

Eq. (2)

I = η_{lens} \cdot Ω \cdot L_{obj} [W \cdot m^{- 2}],

for a lens with optical transmittance

η_{lens}

.

Fig. 1

Parameters of (a) focal length $f$ , aperture diameter $D$ and resulting solid angle $Ω$ and (b) quadratic pixels with fill factor $γ_{ff}$ .

On the sensor some area is used for interconnects and transistors so that only some of the area is sensitive to light. Figure 1(b) shows pixels of size $d_{p}$ . The ratio of active to total areas is expressed as an effective sensor fill factor $γ_{ff}$ . Even with clever manufacturing like micro-lenses or back side illumination, $γ_{ff} < 1$ holds. A single pixel thus captures a certain amount of radiant power (radiant flux) $Φ_{pix}$ of the sensor irradiance

Eq. (3)

Φ_{pix} = d_{p}^{2} \cdot γ_{ff} \cdot I [W] .

A single photon of wavelength

λ

has the energy

h \frac{c}{λ}

with the speed of light

c

and Planck’s constant

h

. The radiant flux

Φ

thus consists of

N_{phot}

photons

Eq. (4)

N_{phot} = \frac{1}{h \frac{c}{λ}} \cdot τ_{\exp} \cdot Φ_{pix},

during a certain time interval (exposure time) of

τ_{\exp}

. In the photoreceptor only some of these photons are converted into electrons

N_{elec} = η_{qe} \cdot N_{phot}

while others are not, due to reflection, recombination and other material interactions. The conversion rate is expressed as quantum efficiency

η_{qe}

.⁷ The electrons are then collected in the pixel. Although we will see

N_{elec}

, on average, the charge is still quantized and the actual number of electrons is subject to shot noise due to the occurrence of random events. For

N

electrons the associated shot noise is of strength

\sqrt{N}

.⁵ As

N_{elec} \propto Φ_{pix}

, signal power is represented with

N_{elec}

and SNR thus calculates to

Eq. (5)

SNR = N_{elec} / \sqrt{N_{elec}} = \sqrt{N_{elec}} .

In CCD or CMOS technology there are further sources of sensor noise,⁸ which are neglected in the ideal case.

SNR is a parameter that is directly visible in the final images. For answering the original question, we can combine the above equations. This leads to

Eq. (6)

d_{p, \min} = \sqrt{{SNR}^{2} \cdot \frac{h \frac{c}{λ}}{η_{qe} \cdot γ_{ff} \cdot τ_{\exp}} \cdot \frac{4 \cdot {(f / D)}^{2}}{η_{lens} \cdot π \cdot L_{obj}}} .

3. Results for Ideal System

At first we assume ideal technology. A typical indoor scene is illuminated with a luminance of $L_{v} = 100 cd m^{- 2}$ .⁹ For the peak sensitivity of the human eye at a wavelength of $λ = 555 nm$ the SI unit candela is defined¹⁰ as radiant intensity of $1 / 683 W {sr}^{- 1}$ . The radiance in object space is then

Eq. (7)

L_{obj} = 100 \cdot \frac{1}{683} \approx 0.146 W {sr}^{- 1} m^{- 2} .

We further assume a perfectly transparent lens with

η_{lens} = 1

, a wide aperture

f / D = 2.8

, fill factor

γ_{ff} = 1

and quantum efficiency

η_{qe} = 1

. For achieving typical video frame rates a maximum exposure time of

τ_{\exp} = 0.03 s

is used. For a human observer, images without visible noise are preferred. From psychophysical studies a thousand-photon limit is reported as the threshold for visibility of shot noise.⁵ We therefore set

SNR = \sqrt{1000} \approx 32

. With green light with

λ = 555 nm

the minimum pixel size calculates to

d_{p, \min} = 0.9 μm

.

The influence of different apertures is shown in Fig. 2. With larger aperture diameters, even smaller pixels can be used. A variation of luminance is also possible: In practice, the human color perception (photoptic vision) starts at $L_{v} = 3 cd m^{- 2}$ .⁹ The luminance in daylight exterior scenarios is typically $L_{v} = 10^{4} cd m^{- 2}$ .⁹ The resulting minimum pixel sizes thus range from 5 to 0.09 μm as shown in Fig. 3.

Fig. 2

Minimum pixel sizes for photon limited system with varying apertures, ideal system with $SNR = \sqrt{1000}$ and scene with luminance of $L_{v} = 100 cd m^{- 2}$ , dashed line for $f / D = 2.8$ .

Fig. 3

Minimum pixel sizes for photon limited system with varying luminance, ideal system with $SNR = \sqrt{1000}$ and aperture $f / D = 2.8$ , dashed line for $L_{v} = 100 cd m^{- 2}$ .

4. Radiometric Modeling

Up to now, we used monochromatic light only. We now extend this and also include the spectral distribution of light. Again, we start with a scene with a luminance of $L_{v} = 100 cd m^{- 2}$ . Now, the light is made up of radiation from a light bulb. This is modeled as a black body at a certain color temperature $T$ and a spectral radiance of

Eq. (8)

L_{obj, λ} (λ, T) = L_{0} \cdot \frac{2 h c^{2}}{λ^{5}} \cdot \frac{1}{e^{\frac{h c}{λ k T}} - 1} [W \cdot {sr}^{- 1} \cdot m^{- 3}] .

With the photoptic luminous efficiency function¹¹

V_{m}

, we set

Eq. (9)

K_{m} \cdot \int_{0}^{\infty} V_{m} \cdot (λ) \cdot L_{obj, λ} (λ, T) d λ \overset{!}{=} L_{v},

with

K_{m} = 683 lm W^{- 1}

. The resulting normalized spectral radiance

L_{obj, λ} (λ, T)

is now perceived by the human eye as a luminance of

L_{v} = 100 cd m^{- 2}

. Figure 4 shows the resulting set of normalized spectral radiances for typical color temperatures.

Fig. 4

Spectral radiance of black bodies with temperatures $T$ , intensity scaled to be perceived as luminance of $100 cd m^{- 2}$ .

Today, most cameras are used to capture scenes for later viewing by a human. The camera should therefore create a representation of the scene that is similar to that of the human visual system. We simulate an ideal camera with the spectral sensitivity curves based on the Stockman and Sharpe cone measurements of the human eye.¹² The corresponding spectral sensitivity functions for long (L), medium (M) and short (S) wavelengths are shown in Fig. 5. However, we assume an ideal camera with ideal color filters and material without any attenuation ( $η_{qe} = 1$ ) at peak efficiency.

Fig. 5

Sensitivity functions of 10-deg cone fundamentals for $L$ , $M$ and $S$ cone and luminous efficiency function $V_{m}$ .

In Table 1, the resulting minimum pixel sizes are shown for the radiometric simulation. The luminosity case with monochromatic light at $λ = 555 nm$ corresponds to the ideal simulation from above. There is less than 10% error for the simulation with $L$ and $M$ cones compared to the luminosity. This is plausible from the high similarity of the respective sensitivity curves. However, the capturing of blue light (short wavelengths with cone $S$ ) requires larger pixels. At short wavelengths, the individual photons have a higher energy and thus, there are fewer for a given radiant flux. This explains the problem of inferior performance of blue color channels in typical digital cameras. The extreme case of observing monochromatic green light with a short wavelength sensitivity leads to even fewer photons and would require pixels with 26 μm. In general, the monochromatic calculation is only slightly optimistic but gives a good approximation to a radiometric computation.

Table 1

Minimum pixel sizes (in μm) based on radiometric calculations for light sources with black body radiation of temperature T and monochromatic light source.

Light source	Cone $L$	Cone $M$	Cone $S$	Luminosity
$T = 3200 K$	0.86	1.04	2.13	0.89
$T = 4500 K$	0.89	1.02	1.64	0.90
$T = 5600 K$	0.90	1.01	1.45	0.90
$T = 6400 K$	0.91	1.01	1.36	0.90
$λ = 555 nm$	0.92	0.92	26.00	0.90

5. Results with Current Technology

The above numbers represent the theoretical limit for ideal sensors. In practice, a real world camera does not achieve these numbers. For example, a highly optimized three layer stacked image sensor is reported by Hannebauer et al.¹³ For pixels of size $d_{p} = 4.8 μm$ a high fill factor of $γ_{ff} = 0.95$ and quantum efficiency of $η_{qe} = 0.8$ is possible with many (costly) optimizations. In current 1.4 μm consumer grade sensors the backside illumination (BSI) technology enables close to 100% fill factor.¹⁴ For color imaging, the spectral sensitivity is not without attenuation and peak quantum efficiencies of about $η_{qe} \approx 0.5$ are reported by OmniVision¹⁴ and Aptina.¹⁵ In scientific CMOS sensors, the combined sensor readout noise is reported as low as $1.3 electrons / pixel$ ¹⁶ and can thus be neglected among 1000 electrons. The combined assumption of $η_{lens} = 0.95$ , $γ_{ff} = 0.95$ and $η_{qe} = 0.5$ leads to a minimum pixel size of $d_{p, \min} = 1.34 μm$ . With mass-market sensors and additional noise,⁸ larger pixels are required.

These small pixels also reach another technological limit of decreasing full well capacity. For example Aptina reports¹⁵ $C = 5000$ electrons, which leaves only a dynamic range of $5 ∶ 1$ from noise visibility⁵ to overexposure. As a result, most of the image will still look noisy. However, this is a technological challenge that could be addressed with multiple readouts during the exposure.¹⁷

Another limitation comes with optical diffraction. Even in ideal optics the achievable resolution of a camera system is limited. The Sparrow criterion suggests³ that there is no gain in resolution below a critical pixel size of $d_{p, crit} = \frac{λ}{2} \cdot f / D$ . For our example of $f / D = 2.8$ and $λ = 555 nm$ , we obtain $d_{p, crit} = 0.78 μm$ . Achieving this limit, however, is challenging, especially in the off-axis field, and leads to expensive optics. A further decrease in aperture requires a dramatic increase of the technological efforts and smaller tolerances for optics manufacturers.

6. Conclusion

In our photometric analysis, we discuss the number of photons per pixel. With small pixels the image quality is limited by shot noise, and for indoor scenarios the current video cameras are surprisingly close to this fundamental limit. We estimate that even with ideal technology, a pixel size below $d_{p} = 0.9 μm$ will not capture enough light to generate visually pleasing videos any more. Current technology is far from perfect and with optimistic assumptions, the limit at $d_{p} = 1.34 μm$ is close to current sensors. However, for other imaging scenarios like outdoor daylight still photography, there is plenty of room at the bottom.

References

1.

R. Fontaine, “A review of the 1.4 um pixel generation,” in Int. Image Sensor Workshop (IISW), (2011). Google Scholar

2.

J. B. JonasU. SchneiderG. O. H. Naumann, “Count and density of human retinal photoreceptors,” Graefes Arch. Clin. Exp. Ophthalmol., 230 505 –510 (1992). http://dx.doi.org/10.1007/BF00181769 GACODL 0721-832X Google Scholar

3.

J. Goodman, Introduction to Fourier Optics, Roberts & Company Publishers, Englewood, Colorado, USA (2005). Google Scholar

4.

J. Farrellet al., “A simulation tool for evaluating digital camera image quality,” SPIE Electron. Imag.—Image Quality and System Performance, 5294 124 –131 (2004). http://dx.doi.org/10.1117/12.537474 Google Scholar

5.

F. XiaoJ. FarrellB. Wandell, “Psychophysical thresholds and digital camera sensitivity: the thousand photon limit,” SPIE Electron. Imag.—Digital Photography, 5678 75 –84 (2005). http://dx.doi.org/10.1117/12.587468 Google Scholar

6.

R. Kingslake, Optical System Design, 1 Academic Press, London (1983). Google Scholar

7.

B. Fowleret al., “A method for estimating quantum efficiency for CMOS image sensors,” SPIE Electron. Imag.—Solid State Sensor Arrays: Development and Applications II, 3301 178 –185 (1998). http://dx.doi.org/10.1117/12.304561 Google Scholar

8.

R. Gowet al., “A comprehensive tool for modeling CMOS image-sensor-noise performance,” IEEE Trans. Electron. Devices, 54 1321 –1329 (2007). http://dx.doi.org/10.1109/TED.2007.896718 IETDAI 0018-9383 Google Scholar

9.

W. Smith, Modern Optical Engineering: The Design of Optical Systems, Tata McGraw-Hill Education, Englewood, Colorado, USA (1990). Google Scholar

10.

P. Giacomo, “News from the BIPM: resolution 3—definition of the candela,” Metrologia, 16 (1), 55 –61 (1980). http://dx.doi.org/10.1088/0026-1394/16/1/008 MTRGAU 0026-1394 Google Scholar

11.

L. Sharpeet al., “A luminous efficiency function,

V * (λ)

, for daylight adaptation,” J. Vision, 5 (11), 948 –968 (2005). http://dx.doi.org/10.1167/5.11.3 1534-7362 Google Scholar

12.

A. StockmanL. Sharpe, “The spectral sensitivities of the middle-and long-wavelength-sensitive cones derived from measurements in observers of known genotype,” Vis. Res., 40 (13), 1711 –1737 (2000). http://dx.doi.org/10.1016/S0042-6989(00)00021-3 VISRAM 0042-6989 Google Scholar

13.

R. Hannebaueret al., “Optimizing quantum efficiency in a stacked CMOS sensor,” SPIE Electron. Imag.—Sensors, Cameras, and Systems for Industrial, Scientific, and Consumer Applications XII, 7875 (1), 787505 (2011). http://dx.doi.org/10.1117/12.873610 Google Scholar

14.

H. Rhodeset al., “The mass production of second generation 65 nm BSI CMOS image sensors,” in Int. Image Sensor Workshop (IISW), (2011). Google Scholar

15.

G. Agranovet al., “Pixel continues to shrink … pixel development for novel CMOS image sensors: a review of the 1.4 um pixel generation,” in Int. Image Sensor Workshop (IISW), (2011). Google Scholar

16.

B. Fowleret al., “A 5.5 mpixel

100 frames / \sec

wide dynamic range low noise CMOS image sensor for scientific applications,” SPIE Electron. Imag.—Sensors, Cameras, and Systems for Industrial/Scientific Applications XI, 7536 753607 (2010). http://dx.doi.org/10.1117/12.846975 Google Scholar

17.

M. Schöberlet al., “Digital neutral density filter for moving picture cameras,” SPIE Electron. Imag.—Computational Imag. VIII, 7533 75330L (2010). http://dx.doi.org/10.1117/12.838833 Google Scholar

Citation Download Citation

Michael Schöberl, André Kaup, Andreas Brückner, and Siegfried Fößel "Photometric limits for digital camera systems," Journal of Electronic Imaging 21(2), 020501 (15 June 2012). https://doi.org/10.1117/1.JEI.21.2.020501

Published: 15 June 2012

Access the abstract

JOURNAL ARTICLE
4 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 28 scholarly publications.

Explore citations on Lens.org

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Signal to noise ratio

Sensors

Quantum efficiency

Cameras

Photons

Digital cameras

Imaging systems

1.

Introduction

2.

Image Acquisition Model

Eq. (1)

Eq. (2)

Fig. 1

Eq. (3)

Eq. (4)

Eq. (5)

Eq. (6)

3.

Results for Ideal System

Eq. (7)

Fig. 2

Fig. 3

4.

Radiometric Modeling

Eq. (8)

Eq. (9)

Fig. 4

Fig. 5

Table 1

5.

Results with Current Technology

6.

Conclusion

References

Show All Keywords

Keywords/Phrases

Search In:

Publication Years