Xuemei Zhang, Yuli Gao, C. Brian Atkins, Phil Cheatle, Jun Xiao, Hui Chao, Peng Wu, Daniel Tretter, David Slatter, Andrew Carter, Roland Penny, Chris Willis
The design of a computer-assisted photobook authoring solution continues to be a challenging task, since consumers
want four things from such an application: simplicity, quality, customizability and speed. Our AutoPhotobook solution
uses technology to enable a system that preserves all four characteristics, providing high quality custom photobooks
while keeping complexity and authoring time modest. We leverage both design knowledge and image understanding
algorithms to automate time-consuming tasks like image selection, grouping, cropping and layout. This streamlines the
initial creation phase, so the user is never stuck staring at a blank page wondering where to begin. Our composition
engine then allows users to easily edit the book: adding, swapping or moving objects, exploring different page layouts
and themes, and even dynamically adjusting the aspect ratio of the final book. Our technologies enable even novice
users to easily create aesthetically pleasing photobooks that tell their underlying stories. AutoPhotobook provides
advances over prior solutions in the following areas: automatic image selection and theme-based image grouping;
dynamic page layout including text support; automatic cropping; design-preserving background artwork transformation;
and a simple yet powerful user interface for personalization. In this paper, we present these technologies and illustrate
how they work together to improve the photobook authoring process.
Selecting cosmetics requires visual information and often benefits from the assessments of a cosmetics expert. In this
paper we present a unique mobile imaging application that enables women to use their cell phones to get immediate
expert advice when selecting personal cosmetic products. We derive the visual information from analysis of camera
phone images, and provide the judgment of the cosmetics specialist through use of an expert system. The result is a new
paradigm for mobile interactions-image-based information services exploiting the ubiquity of camera phones. The
application is designed to work with any handset over any cellular carrier using commonly available MMS and SMS
features. Targeted at the unsophisticated consumer, it must be quick and easy to use, not requiring download capabilities
or preplanning. Thus, all application processing occurs in the back-end system and not on the handset itself. We present
the imaging pipeline technology and a comparison of the services' accuracy with respect to human experts.
KEYWORDS: Video, Medical imaging, Cameras, Projection systems, Imaging systems, Digital imaging, Digital video recorders, Diagnostics, Telecommunications, 3D image processing
We have developed the CMAS system (Collaborative Medical Annotation System)
so that medical professionals will be able to easily annotate
digital medical records that contain medical imaging or procedure videos.
The CMAS system enables a non-technical person to annotate
a medical image or video with their recorded presence.
The CMAS system displays medical images via a
projector onto a screen; when a doctor (or patient)
physically walks in front of this screen with the medical
image and gives his/her opinion while gesturing at the image,
the CMAS system intuitively captures this interaction
by creating a video annotation with HP's Active Shadows technology.
The CMAS system automatically transforms physical
interactions, ranging from a laser pointer spot to a doctor's physical
presence, into video annotation that then can be overlaid on
top of the medical image or seamlessly inserted into the procedure video.
Annotated in such a manner, the medical record
retains the historical development of the diagnostic medical opinion,
explained through presence of doctors and their respective annotations.
The CMAS system structures the annotation of digital medical records such
that image/video annotations from multiple sources, at
different times, and from different locations can be maintained
within a historical context and be consistently referenced among multiple
annotations.
To run a targeted campaign involves coordination and management across numerous organizations and complex process flows. Everything from market analytics on customer databases, acquiring content and images, composing the materials, meeting the sponsoring enterprise brand standards, driving through production and fulfillment, and evaluating results; all processes are currently performed by experienced highly trained staff. Presented is a developed solution that not only brings together technologies that automate each process, but also automates the entire flow so that a novice user could easily run a successful campaign from their desktop. This paper presents the technologies, structure, and process flows used to bring this system together. Highlighted will be how the complexity of running a targeted campaign is hidden from the user through technologies, all while providing the benefits of a professionally managed campaign.
This paper introduces a novel Active Document Versioning system that can extract the layout template and constraints from the original document and then automatically adjust the layout to accommodate new contents. "Active" reflects several unique features of the system: First, the need of handcrafting adjustable templates is largely eliminated through layout understanding techniques that can convert static documents into Active Layout Templates and accompanying
constraints. Second, through the linear text block modeling and the two-pass constraint solving algorithm, it supports a rich set of layout operations, such as simultaneous optimization of text block width and height, integrated image cropping, and non-rectangular text wrapping. This system has been successfully applied to a wide range of professionally designed documents. This paper covers both the core algorithms and the implementation.
PDF is a document format for final presentation. It preserves the original document layout but often not the document logical structure. Graphic illustrations such as figures and tables in PDF often consist of ungrouped graphic primitives such as lines, curves and small text elements. In this paper, we present a bottom up approach to recognize graphic illustration in PDF document. Vicinities of page elements in both 2D space and indexes in layer are used to understand the logical connection between elements. Graphics recognition and elements grouping for illustration is an important part in understanding the document logical structure. This technique can be used in automatic figure extraction, document re-flow and document transformation.
KEYWORDS: Computing systems, Digital signal processing, Functional magnetic resonance imaging, Image processing, Magnetic resonance imaging, Embedded systems, Image restoration, Data acquisition, Mercury, Data storage
Due to the dynamic nature of brain studies in functional magnetic resonance imaging (fMRI), fast pulse sequences such as echo planar imaging (EPI) and spiral are often used for higher temporal resolution. Hundreds of frames of two- dimensional (2-D) images or multiple three-dimensional (3-D) images are often acquired to cover a larger space and time range. Therefore, fMRI often requires a much larger data storage, faster data transfer rate and higher processing power than conventional MRI. In Mercury Computer Systems' PCI-based embedded computer system, the computer architecture allows the concurrent use of a DMA engine for data transfer and CPU for data processing. This architecture allows a multicomputer to distribute processing and data with minimal time spent transferring data. Different types and numbers of processors are available to optimize system performance for the application. The fMRI reconstruction was first implemented in Mercury's PCI-based embedded computer system by using one digital signal processing (DSP) chip, with the host computer running under the Windows NTR platform. Double buffers in SRAM or cache were created for concurrent I/O and processing. The fMRI reconstruction was then implemented in parallel using multiple DSP chips. Data transfer and interprocessor synchronization were carefully managed to optimize algorithm efficiency. The image reconstruction times were measured with different numbers of processors ranging from one to 10. With one DSP chip, the timing for reconstructing 100 fMRI images measuring 128 X 64 pixels was 1.24 seconds, which is already faster than most existing commercial MRI systems. This PCI-based embedded multicomputer architecture, which has a nearly linear improvement in performance, provides high performance for fMRI processing. In summary, this embedded multicomputer system allows the choice of computer topologies to fit the specific application to achieve maximum system performance.
Digital vascular computer systems are used for radiology and fluoroscopy (R/F), angiography, and cardiac applications. In the United States alone, about 26 million procedures of these types are performed annually: about 81% R/F, 11% cardiac, and 8% angiography. Digital vascular systems have a very wide range of performance requirements, especially in terms of data rates. In addition, new features are added over time as they are shown to be clinically efficacious. Application-specific processing modes such as roadmapping, peak opacification, and bolus chasing are particular to some vascular systems. New algorithms continue to be developed and proven, such as Cox and deJager's precise registration methods for masks and live images in digital subtraction angiography. A computer architecture must have high scalability and reconfigurability to meet the needs of this modality. Ideally, the architecture could also serve as the basis for a nonvascular R/F system.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.