Publication statistics

Pub. period:1983-2007
Pub. count:43
Number of co-authors:42



Co-authors

Number of publications with 3 favourite co-authors:

Michael G. Christel:8
Rong Yan:8
Rong Jin:6

 

 

Productive colleagues

Alexander G. Hauptmann's 3 most productive colleagues in number of publications:

Jamie Callan:47
Sidney Fels:36
Luo Si:32
 
 
 

Upcoming Courses

go to course
The Practical Guide to Usability
89% booked. Starts in 6 days
go to course
The Ultimate Guide to Visual Perception and Design
83% booked. Starts in 12 days
 
 

Featured chapter

Marc Hassenzahl explains the fascinating concept of User Experience and Experience Design. Commentaries by Don Norman, Eric Reiss, Mark Blythe, and Whitney Hess

User Experience and Experience Design !

 
 

Our Latest Books

 
 
The Social Design of Technical Systems: Building technologies for communities. 2nd Edition
by Brian Whitworth and Adnan Ahmad
start reading
 
 
 
 
Gamification at Work: Designing Engaging Business Software
by Janaki Mythily Kumar and Mario Herger
start reading
 
 
 
 
The Social Design of Technical Systems: Building technologies for communities
by Brian Whitworth and Adnan Ahmad
start reading
 
 
 
 
The Encyclopedia of Human-Computer Interaction, 2nd Ed.
by Mads Soegaard and Rikke Friis Dam
start reading
 
 

Alexander G. Hauptmann

Has also published under the name of:
"Alex G. Hauptmann", "A. G. Hauptmann", and "Alexander Hauptmann"

Personal Homepage:
http://www.cs.cmu.edu/~alex/

Add description
Rename / change spelling
Add publication
 

Publications by Alexander G. Hauptmann (bibliography)

 what's this?
2007
 
Edit | Del

Yan, Rong and Hauptmann, Alexander G. (2007): Query expansion using probabilistic local feedback with application to multimedia retrieval. In: Silva, Mario J., Laender, Alberto H. F., Baeza-Yates, Ricardo A., McGuinness, Deborah L., Olstad, Bjrn, Olsen, ystein Haug and Falco, Andr O. (eds.) Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management - CIKM 2007 November 6-10, 2007, Lisbon, Portugal. pp. 361-370.

 
Edit | Del

Wu, Xiao, Hauptmann, Alexander G. and Ngo, Chong-Wah (2007): Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts. In: Lienhart, Rainer, Prasad, Anand R., Hanjalic, Alan, Choi, Sunghyun, Bailey, Brian P. and Sebe, Nicu (eds.) Proceedings of the 15th International Conference on Multimedia 2007 September 24-29, 2007, Augsburg, Germany. pp. 168-177.

 
Edit | Del

Yang, Jun, Yan, Rong and Hauptmann, Alexander G. (2007): Cross-domain video concept detection using adaptive svms. In: Lienhart, Rainer, Prasad, Anand R., Hanjalic, Alan, Choi, Sunghyun, Bailey, Brian P. and Sebe, Nicu (eds.) Proceedings of the 15th International Conference on Multimedia 2007 September 24-29, 2007, Augsburg, Germany. pp. 188-197.

 
Edit | Del

Wu, Xiao, Hauptmann, Alexander G. and Ngo, Chong-Wah (2007): Practical elimination of near-duplicates from web video search. In: Lienhart, Rainer, Prasad, Anand R., Hanjalic, Alan, Choi, Sunghyun, Bailey, Brian P. and Sebe, Nicu (eds.) Proceedings of the 15th International Conference on Multimedia 2007 September 24-29, 2007, Augsburg, Germany. pp. 218-227.

2006
 
Edit | Del

Yan, Rong and Hauptmann, Alexander G. (2006): Probabilistic latent query analysis for combining multiple retrieval sources. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2006. pp. 324-331.

Combining the output from multiple retrieval sources over the same document collection is of great importance to a number of retrieval tasks such as multimedia retrieval, web retrieval and meta-search. To merge retrieval sources adaptively according to query topics, we propose a series of new approaches called probabilistic latent query analysis (pLQA), which can associate non-identical combination weights with latent classes underlying the query space. Compared with previous query independent and query-class based combination methods, the proposed approaches have the advantage of being able to discover latent query classes automatically without using prior human knowledge, to assign one query to a mixture of query classes, and to determine the number of query classes under a model selection principle. Experimental results on two retrieval tasks, i.e., multimedia retrieval and meta-search, demonstrate that the proposed methods can uncover sensible latent classes from training data, and can achieve considerable performance gains.

© All rights reserved Yan and Hauptmann and/or ACM Press

 
Edit | Del

Hauptmann, Alexander G., Lin, Wei-Hao, Yan, Rong, Yang, Jun and Chen, Ming-yu (2006): Extreme video retrieval: joint maximization of human and computer performance. In: Nahrstedt, Klara, Turk, Matthew, Rui, Yong, Klas, Wolfgang and Mayer-Patel, Ketan (eds.) Proceedings of the 14th ACM International Conference on Multimedia October 23-27, 2006, Santa Barbara, CA, USA. pp. 385-394.

 
Edit | Del

Yang, Jun and Hauptmann, Alexander G. (2006): 3WNews: who, where, and when in news video. In: Nahrstedt, Klara, Turk, Matthew, Rui, Yong, Klas, Wolfgang and Mayer-Patel, Ketan (eds.) Proceedings of the 14th ACM International Conference on Multimedia October 23-27, 2006, Santa Barbara, CA, USA. pp. 503-504.

2005
 
Edit | Del

Lin, Wei-Hao and Hauptmann, Alexander G. (2005): Revisiting the effect of topic set size on retrieval error. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2005. pp. 637-638.

 
Edit | Del

Chen, Ming-yu, Christel, Michael G., Hauptmann, Alexander G. and Wactlar, Howard D. (2005): Putting active learning into multimedia applications: dynamic definition and refinement of concept classifiers. In: Zhang, Hongjiang, Chua, Tat-Seng, Steinmetz, Ralf, Kankanhalli, Mohan S. and Wilcox, Lynn (eds.) Proceedings of the 13th ACM International Conference on Multimedia November 6-11, 2005, Singapore. pp. 902-911.

 
Edit | Del

Yang, Jun, Yan, Rong and Hauptmann, Alexander G. (2005): Multiple instance learning for labeling faces in broadcasting news video. In: Zhang, Hongjiang, Chua, Tat-Seng, Steinmetz, Ralf, Kankanhalli, Mohan S. and Wilcox, Lynn (eds.) Proceedings of the 13th ACM International Conference on Multimedia November 6-11, 2005, Singapore. pp. 31-40.

2004
 
Edit | Del

Chen, Ming-yu and Hauptmann, Alexander G. (2004): Multi-modal classification in digital news libraries. In: JCDL04: Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries 2004. pp. 212-213.

This paper describes a comprehensive approach to construct robust multi-modal video classification on a specific digital source, broadcast news. Broadcast news has a very stable structure and every segment has its specific purpose. Video classification can support fundamental understanding of the structure of the video and the content. The variety of video content makes it hard to classify; however, it also provides multimodal information Our approach tries to solve two important issues of multimodal classification. The first one is to select few discriminative features from many raw features and the second one is to efficiently combine multiple sources. We applied Fisher's Linear Discriminant (FLD) for feature selection and concatenated the projections into a single synthesized feature vector as the combination strategy. Experimental results on the 2003 TRECVID news video archive show that our approach achieves very robust and accurate performance.

© All rights reserved Chen and Hauptmann and/or ACM Press

 
Edit | Del

Hauptmann, Alexander G. and Christel, Michael G. (2004): Successful approaches in the TREC video retrieval evaluations. In: Schulzrinne, Henning, Dimitrova, Nevenka, Sasse, Martina Angela, Moon, Sue B. and Lienhart, Rainer (eds.) Proceedings of the 12th ACM International Conference on Multimedia October 10-16, 2004, New York, NY, USA. pp. 668-675.

 
Edit | Del

Yan, Rong, Yang, Jun and Hauptmann, Alexander G. (2004): Learning query-class dependent weights in automatic video retrieval. In: Schulzrinne, Henning, Dimitrova, Nevenka, Sasse, Martina Angela, Moon, Sue B. and Lienhart, Rainer (eds.) Proceedings of the 12th ACM International Conference on Multimedia October 10-16, 2004, New York, NY, USA. pp. 548-555.

 
Edit | Del

Yang, Jun and Hauptmann, Alexander G. (2004): Naming every individual in news video monologues. In: Schulzrinne, Henning, Dimitrova, Nevenka, Sasse, Martina Angela, Moon, Sue B. and Lienhart, Rainer (eds.) Proceedings of the 12th ACM International Conference on Multimedia October 10-16, 2004, New York, NY, USA. pp. 580-587.

2003
 
Edit | Del

Lin, Wei-Hao, Jin, Rong and Hauptmann, Alexander G. (2003): Web Image Retrieval Re-Ranking with Relevance Model. In: 2003 IEEE / WIC International Conference on Web Intelligence - WI 2003 13-17 October, 2003, Halifax, Canada. pp. 242-248.

 
Edit | Del

Yan, Rong and Hauptmann, Alexander G. (2003): The combination limit in multimedia retrieval. In: Rowe, Lawrence A., Vin, Harrick M., Plagemann, Thomas, Shenoy, Prashant J. and Smith, John R. (eds.) Proceedings of the Eleventh ACM International Conference on Multimedia November 2-8, 2003, Berkeley, CA, USA. pp. 339-342.

 
Edit | Del

Yan, Rong, Hauptmann, Alexander G. and Jin, Rong (2003): Negative pseudo-relevance feedback in content-based video retrieval. In: Rowe, Lawrence A., Vin, Harrick M., Plagemann, Thomas, Shenoy, Prashant J. and Smith, John R. (eds.) Proceedings of the Eleventh ACM International Conference on Multimedia November 2-8, 2003, Berkeley, CA, USA. pp. 343-346.

 
Edit | Del

Derthick, Mark, Christel, Michael G., Hauptmann, Alexander G. and Wactlar, Howard D. (2003): Constant Density Displays Using Diversity Sampling. In: InfoVis 2003 - 9th IEEE Symposium on Information Visualization 20-21 October, 2003, Seattle, WA, USA. .

2002
 
Edit | Del

Hauptmann, Alexander G. and Papernick, Norman D. (2002): Video-cuebik: adapting image search to video shots. In: JCDL02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries 2002. pp. 156-157.

We propose a new analysis for searching images in video libraries that goes beyond simple image search, which compares one still image frame to another. The key idea is to expand the definition of an image to account for the variability in the sequence of video frames that comprise a shot. A first implementation of this method for a QBIC-like image search engine shows a clear improvement over still image search. A combination of the traditional still image search and the new video image search provided the overall best results on the TREC video retrieval evaluation data.

© All rights reserved Hauptmann and Papernick and/or ACM Press

 
Edit | Del

Hauptmann, Alexander G., Jin, Rong and Ng, Tobun Dorbin (2002): Multi-modal information retrieval from broadcast video using OCR and speech recognition. In: JCDL02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries 2002. pp. 160-161.

We examine multi-modal information retrieval from broadcast video where text can be read on the screen through OCR and speech recognition can be performed on the audio track. OCR and speech recognition are compared on the 2001 TREC Video Retrieval evaluation corpus. Results show that OCR is more important that speech recognition for video retrieval. OCR retrieval can further improve through dictionary-based post-processing. We demonstrate how to utilize imperfect multi-modal metadata results to benefit multi-modal information retrieval.

© All rights reserved Hauptmann et al. and/or ACM Press

 
Edit | Del

Hauptmann, Alexander G., Christel, Michael G. and Papernick, Norman D. (2002): Video retrieval with multiple image search strategies. In: JCDL02: Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries 2002. p. 376.

 
Edit | Del

Jin, Rong, Hauptmann, Alexander G. and Zhai, Cheng Xiang (2002): Title language model for information retrieval. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2002. pp. 42-48.

In this paper, we propose a new language model, namely, a title language model, for information retrieval. Different from the traditional language model used for retrieval, we define the conditional probability P(Q|D) as the probability of using query Q as the title for document D. We adopted the statistical translation model learned from the title and document pairs in the collection to compute the probability P(Q|D). To avoid the sparse data problem, we propose two new smoothing methods. In the experiments with four different TREC document collections, the title language model for information retrieval with the new smoothing method outperforms both the traditional language model and the vector space model for IR significantly.

© All rights reserved Jin et al. and/or ACM Press

 
Edit | Del

Jin, Rong, Si, Luo, Hauptmann, Alexander G. and Callan, Jamie (2002): Language model for IR using collection information. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2002. pp. 419-420.

Information retrieval using meta data can be traced back to the early age of IR where documents are represented by the controlled vocabulary. In this paper, we explore the usage of meta-data information under the framework of language model. We present a new language model that is able to take advantage of the category information for documents to improve the retrieval accuracy. We compare the new language model with the traditional language model over the TREC4 dataset where the collection information for documents is obtained using the k-means clustering method. The new language model outperforms the traditional language model, which verifies our statement.

© All rights reserved Jin et al. and/or ACM Press

 
Edit | Del

Christel, Michael G., Hauptmann, Alexander G., Wactlar, Howard D. and Ng, Tobun D. (2002): Collages as dynamic summaries for news video. In: ACM Multimedia 2002 2002. pp. 561-569.

 
Edit | Del

Lin, Wei-Hao and Hauptmann, Alexander G. (2002): News video classification using SVM-based multimodal classifiers and combination strategies. In: ACM Multimedia 2002 2002. pp. 323-326.

2001
 
Edit | Del

Palmer, C. R., Pesenti, J., Valdes-Perez, R. E., Christel, Michael G., Hauptmann, Alexander G., Ng, D. and Wactlar, H. D. (2001): Demonstration of Hierarchical Document Clustering of Digital Library Retrieval Results. In: JCDL01: Proceedings of the 1st ACM/IEEE-CS Joint Conference on Digital Libraries 2001. p. 451.

As digital libraries grow in size, querying their contents will become as frustrating as querying the web is now. One remedy is to hierarchically cluster the results that are returned by searching a digital library. We demonstrate the clustering of search results from Carnegie Mellon's Informedia database, a large video library that supports indexing and retrieval with automatically generated descriptors.

© All rights reserved Palmer et al. and/or ACM Press

 
Edit | Del

Jin, Rong, Falusos, Christos and Hauptmann, Alexander G. (2001): Meta-scoring: automatically evaluating term weighting schemes in IR without precision-recall. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2001. pp. 83-89.

In this paper, we present a method that can automatically evaluate performance of different term weighting schemes in information retrieval without resorting to precision-recall based on human relevance judgments. Specifically, the problem is: given two document-term matrixes generated from two different term weighting schemes, can we tell which term weighting scheme will performance better than the other? We propose a meta-scoring function, which takes as input the document-term matrix generated by some term weighting scheme and computes a goodness score from the document-term matrix. In our experiments, we found out that this score is highly correlated with the precision-recall measurement for all the collections and term weighting schema we tried. Thus, we conclude that our meta-scoring function can be a substitute for the precision-recall measurement that needs relevance judgments of human subject. Furthermore, this meta-scoring function is not limited only to text information retrieval can be applied to fields such as image and DNA retrieval.

© All rights reserved Jin et al. and/or ACM Press

2000
 
Edit | Del

Kennedy, Paul E. and Hauptmann, Alexander G. (2000): Automatic Title Generation for EM. In: DL00: Proceedings of the 5th ACM International Conference on Digital Libraries 2000. pp. 230-231.

Our prototype automatic title generation system inspired by statistical machine-translation approaches [1] treats the document title like a translation of the document. Titles can be generated without extracting words from the document. A large corpus of documents with human-assigned titles is required for training title "translation" models. On an f1 evaluation score our approach outperformed another approach based on Bayesian probability estimates [7].

© All rights reserved Kennedy and Hauptmann and/or ACM Press

 
Edit | Del

Wactlar, Howard D., Hauptmann, Alexander G., Christel, Michael G., Houghton, Ricky and Olligschlaeger, Andreas M. (2000): Complementary Video and Audio Analysis for Broadcast News Archives. In Communications of the ACM, 43 (2) pp. 42-47.

1999
 
Edit | Del

Wactlar, Howard D., Christel, Michael G., Gong, Yihong and Hauptmann, Alexander G. (1999): Lessons Learned from Building a Terabyte Digital Video Library. In IEEE Computer, 32 (2) pp. 66-73.

1998
 
Edit | Del

Hauptmann, Alexander G. and Lee, Danny (1998): Topic Labeling of Broadcast News Stories in the Informedia Digital Video Library. In: DL98: Proceedings of the 3rd ACM International Conference on Digital Libraries 1998. pp. 287-288.

This paper describes the implementation of a topic labeling component for the Informedia Digital Video Library. Each news story recorded from the evening news is assigned to one of 3178 topic categories using a K-nearest neighbor classification algorithm. In preliminary tests, the system achieved recall of 0.491 with relevance of 0.482 when up to 5 topics could be assigned to a news story.

© All rights reserved Hauptmann and Lee and/or ACM Press

 
Edit | Del

Witbrock, Michael J. and Hauptmann, Alexander G. (1998): Speech Recognition for a Digital Video Library. In JASIST - Journal of the American Society for Information Science and Technology, 49 (7) pp. 619-632.

1997
 
Edit | Del

Witbrock, Michael J. and Hauptmann, Alexander G. (1997): Using Words and Phonetic Strings for Efficient Information Retrieval from Imperfectly Transcribed Spoken Documents. In: DL97: Proceedings of the 2nd ACM International Conference on Digital Libraries 1997. pp. 30-35.

Searching for relevant material in documents containing transcription errors presents new challenges for Information Retrieval. This paper examines information retrieval effectiveness on a corpus of spoken broadcast news documents. For documents transcribed using speech recognition, a substantial number of retrieval errors are due to query terms that occur in the spoken document, but are not transcribed because they are not within the speech recognition system's lexicon, even if that lexicon contains twenty thousand words. It has been shown that a phonetic lattice search in conjunction with full word search regains some of the information lost due to out-of-vocabulary words. In this paper an efficient alternative to this search is proposed that does not require a complete search of the phoneme lattices for all documents at run-time. By using fixed length strings of phonemes instead of phonetic lattices, an information retrieval system can search the phoneme space of a spoken document just as efficiently as a normal word document collection. Experimental evidence is presented that this technique permits the system to recapture some of the information lost due to out-of-vocabulary words in the speech recognition transcripts.

© All rights reserved Witbrock and Hauptmann and/or ACM Press

1995
 
Edit | Del

Mostow, Jack, Hauptmann, Alexander G. and Roth, Steven F. (1995): Demonstration of a Reading Coach that Listens. In: Robertson, George G. (ed.) Proceedings of the 8th annual ACM symposium on User interface and software technology November 15 - 17, 1995, Pittsburgh, Pennsylvania, United States. pp. 77-78.

Project LISTEN stands for "Literacy Innovation that Speech Technology ENables." We will demonstrate a prototype automated reading coach that displays text on a screen, listens to a child read it aloud, and helps where needed. We have tested successive prototypes of the coach on several dozen second graders. [1] reports implementation details and evaluation results. Here we summarize its functionality, the issues it raises in human-computer interaction, and how it addresses them. We are redesigning the coach based on our experience, and will demonstrate its successor at UIST '95.

© All rights reserved Mostow et al. and/or ACM Press

 
Edit | Del

Hauptmann, Alexander G., Witbrock, Michael J., Rudnicky, Alexander I. and Reed, Stephen (1995): Speech for Multimedia Information Retrieval. In: Robertson, George G. (ed.) Proceedings of the 8th annual ACM symposium on User interface and software technology November 15 - 17, 1995, Pittsburgh, Pennsylvania, United States. pp. 79-80.

We describe the Informedia News-on-Demand system. News-on-Demand is an innovative example of indexing and searching broadcast video and audio material by text content. The fully-automatic system monitors TV news and allows selective retrieval to news items based on spoken queries. The user then plays the appropriate video "paragraph". The system runs on a Pentium PC using MPEG-I video compression and the Sphinx-II continuous speech recognition system [6].

© All rights reserved Hauptmann et al. and/or ACM Press

1994
 
Edit | Del

Rudnicky, Alexander I., Hauptmann, Alexander G. and Lee, Kai-Fu (1994): Survey of Current Speech Technology. In Communications of the ACM, 37 (3) pp. 52-57.

1993
 
Edit | Del

Hauptmann, Alexander G. and McAvinney, Paul (1993): Gestures with Speech for Graphic Manipulation. In International Journal of Man-Machine Studies, 38 (2) pp. 231-249.

This paper reports on the utility of gestures and speech to manipulate graphic objects. In the experiment described herein, three different populations of subjects were asked to communicate with a computer using either speech alone, gestures alone, or both. The task was the manipulation of a three-dimensional cube on the screen. They were asked to assume that the computer could see their hands, hear their voices, and understand their gestures and speech as well as a human could. A gesture classification scheme was developed to analyse the gestures of the subjects. A primary objective of the classification scheme was to determine whether common features would be found among the gestures of different users and classes of users. The collected data show a surprising degree of commonality among subjects in the use of gestures as well as speech. In addition to the uniformity of the observed manipulations, subjects expressed a preference for a combined gesture/speech interface. Furthermore, all subjects easily completed the simulated object manipulation tasks. The results of this research, and of future experiments of this type, can be applied to develop a gesture-based or gesture/speech-based system which enables computer users to manipulate graphic objects using easily learned and intuitive gestures to perform spatial tasks. Such tasks might include editing a three-dimensional rendering, controlling the operation of vehicles or operating virtual tools in three dimensions, or assembling an object from components. Knowledge about how people intuitively use gestures to communicate with computers provides the basis for future development of gesture-based input devices.

© All rights reserved Hauptmann and McAvinney and/or Academic Press

1991
 
Edit | Del

Rudnicky, Alexander I. and Hauptmann, Alexander G. (1991): Models for Evaluating Interaction Protocols in Speech Recognition. In: Robertson, Scott P., Olson, Gary M. and Olson, Judith S. (eds.) Proceedings of the ACM CHI 91 Human Factors in Computing Systems Conference April 28 - June 5, 1991, New Orleans, Louisiana. pp. 285-291.

Recognition errors complicate the assessment of speech systems. This paper presents a new approach to modeling spoken language interaction protocols, based on finite Markov chains. An interaction protocol, prescribed by the interface design, defines a set of primitive transaction steps and the order of their execution. The efficiency of an interface depends on the interaction protocol as well as the cost of each different transaction step. Markov chains provide a simple and computationally efficient method for modeling errorful systems. They allow for detailed comparisons between different interaction protocols and between different modalities. The method is illustrated by application to example protocols.

© All rights reserved Rudnicky and Hauptmann and/or ACM Press

 
Edit | Del

Prime, Martin, Hauptmann, Alexander G., Chapin, William L. and Fels, Sidney (1991): Experiences with Computer Glove Input Devices. In: Proceedings of the Human Factors Society 35th Annual Meeting 1991. pp. 413-415.

1989
 
Edit | Del

Hauptmann, Alexander G. (1989): Speech and Gestures for Graphic Image Manipulation. In: Bice, Ken and Lewis, Clayton H. (eds.) Proceedings of the ACM CHI 89 Human Factors in Computing Systems Conference April 30 - June 4, 1989, Austin, Texas. pp. 241-245.

An experiment was conducted with people using gestures and speech to manipulate graphic images on a computer screen. A human was substituted for the recognition devices. The analysis showed that people strongly prefer to use both gestures and speech for the graphics manipulation and that they intuitively use multiple hands and multiple fingers in all three dimensions. There was surprising uniformity and simplicity in the gestures and speech. The analysis of these results provides strong encouragement for future development of integrated multi-modal interaction systems.

© All rights reserved Hauptmann and/or ACM Press

 
Edit | Del

Young, Sheryl R., Hauptmann, Alexander G., Ward, Wayne H., Smith, Edward T. and Werner, Philip (1989): High Level Knowledge Sources in Usable Speech Recognition Systems. In Communications of the ACM, 32 (2) pp. 183-194.

1988
 
Edit | Del

Hauptmann, Alexander G. and Rudnicky, Alexander I. (1988): Talking to Computers: An Empirical Investigation. In International Journal of Man-Machine Studies, 28 (6) pp. 583-604.

This paper describes an empirical study of man-computer speech interaction. The goals of the experiment were to find out how people would communicate with a real-time, speaker-independent continuous speech understanding system. The experimental design compared three communication modes: natural language typing, speaking directly to a computer and speaking to a computer through a human interpreter. The results show that speech to a computer is not as ill-formed as one would expect. People speaking to a computer are more disciplined than when speaking to each other. There are significant differences in the usage of spoken language compared to typed language, and several phenomena which are unique to spoken or typed input respectively. Usefulness for work in speech understanding systems for the future is considered.

© All rights reserved Hauptmann and Rudnicky and/or Academic Press

1983
 
Edit | Del

Hauptmann, Alexander G. and Green, Bert F. (1983): A Comparison of Command, Menu-Selection and Natural-Language Computer Programs. In Behaviour and Information Technology, 2 (2) pp. 163-178.

An experiment compared three man-machine language interfaces to the same interactive computer program: command language, menu selection and natural language. No significant differences were found between language modes for time, error and attitudes measures. Significant task differences were found for word and line counts as well as for several two-way interactions. The results suggest that the interface to the program (natural language, menu selection or commands) may not be as important as the structure and constraints of the underlying program.

© All rights reserved Hauptmann and Green and/or Taylor and Francis

 
Add publication
Show list on your website
 

Join our community and advance:

Your
Skills

Your
Network

Your
Career

 
Join our community!
 
 
 
Date created: Not available
Date last modified: Not available Date created: Not available
Date last modified: Not available

Page Information

Page maintainer: The Editorial Team
URL: http://www.interaction-design.org/references/authors/alexander_g__hauptmann.html