|
Programme
|
Written by Fred Nicolls
|
|
Monday, 17 November 2008 11:11 |
|
A print version of the detailed programme is available here. Delegates will receive a printed copy of the proceedings at the conference. An electronic copy is available here. Thursday 27 November | 8:00-9:00 | Registration | | 9:00-9:05 | Welcome | | 9:05-9:50 | Plenary talk: David van Leeuwen The difference that South Africa has made to Speaker Recognition | | 9:50-10:50 | Pattern recognition I | | | Heuristics for State Splitting in Hidden Markov Models Benjamin Murrell and Jules Raymond Tapamo | | | Binary Naive Bayesian classifiers for correlated Gaussian features: A theoretical analysis Ewald van Dyk and Etienne Barnard | | | An Introduction to Diffusion Maps J. de la Porte, B. M. Herbst, W. Hereman, and S. J. van der Walt | | 10:50-11:10 | TEA
| | 11:10-12:50 | Pattern recognition II | | | Ensemble Feature Selection for Hyperspectral Imagery Gidudu, A., Abe, B. and Marwala, T. | | | The hitchhiker's guide to the particle filter McElory Hoffmann, Karin Hunter, and Ben Herbst | | | Impact Assessment of Missing Data Imputation Models< Dan Golding and Tshilidzi Marwala | | | A note on the separability index Linda Mthembu and Tshilidzi Marwala | | | Extending DTGologto Deal with POMDPs Gavin Rens, Alexander Ferrein, and Etienne van der Poel | | 12:50-14:20 | LUNCH
| | 14:20-16:00 | Pattern recognition applications | | | Acoustic cues identifying phonetic transitions for speech segmentation D.R. van Niekerk and E. Barnard | | | Photometric modelling of real-world objects John Morkel and Fred Nicolls | | | Experiments in automatic assessment of oral proficiency and listening comprehension for bilingual South African speakers Febe de Wet, Pieter Muller, Christa van der Walt, and Thomas Niesler | | | Rapid 3D Measurement and Influences on Precision Using Digital Video Cameras Willie van der Merwe and Kristiaan Schreve | | | Evaluating Topic Models with Stability Alta de Waal and Etienne Barnard | | 16:00-17:30 | TEA AND POSTERS | | 19:30- | CONFERENCE DINNER: Simon's Restaurant at Groot Constantia. Sponsored by Oxford Metrics Group. | Friday 28 November | 9:00-9:45 | Plenary talk: Thomas Hain The Careful Listener: Speech Processing in Meetings | | 9:45-10:45 | Parallel track I (Pattern recognition/speech) | | | Porting A Spoken Language Identification SYSTEM to a new environment Marius Peche, Marelie Davel, and Etienne Barnard | | | Relationship between Structural Diversity and Performance of Multiple Classifiers for Decision Support R. Musehane, F. A. Netshiongolwe, L. Masisi, F. V. Nelwamondo, and T. Marwala | | | A channel normalization for speech recognition in mismatched conditions Neil Kleynhans and Etienne Barnard | | | Parallel track I (Imaging and vision) | | | Action Classification using the Average of Pose Changes Janto F. Dreijer and Ben M. Herbst | | | Real-time surface tracking with uncoded structured light Willie Brink | | | Fiducial-based monocular 3D displacement measurement of breakwater armour unit models R. Vieira, F. van den Bergh, and B. J. van Wyk | | 10:45-11:05 | TEA
| | 11:05-12:45 | Parallel track II (Speech) | | | Data requirements for speaker independent acoustic models Jacob A. C. Badenhorst and Marelie Davel | | | Acoustic analysis of diphthongs in Standard South African English Olga Martirosian and Marelie Davel | | | The origin of the Afrikaans pronunciation: a comparison to west Germanic languages and Dutch dialects Wilbert Heeringa and Febe de Wet | | | Speect: a multilingual text-to-speech system J. A. Louw | | | Homophone Disambiguation in Afrikaans Hendrik J. Groenewald and Marissa van Rooyen | | 11:05-12:45 | Parallel track II (Imaging and vision) | | | 3D Phase Unwrapping of DENSE MRI Images Using Region Merging Joash N. Ongori, Ernesta M. Meintjes, and Bruce S. Spottiswoode | | | Fast Calculation of Digitally Reconstructed Radiographs using Light Fields Cobus Carstens and Neil Muller | | | Traffic sign detection and classification using colour and shape cues F. P. Senekal | | | Hough Transform Tuned Bayesian Classifier for Overhead Power Line Inspection Z. R. S. Gaspar, Shengzhi Du, and B. J. van Wyk | | | Alignment invariant image comparison implemented on the GPU Hans Roos, Yuko Roodt, and Willem A. Clarke | | 12:45-13:00 | Annual general meeting | | 13:00-13:30 | LUNCH | |
|
Last Updated ( Monday, 24 November 2008 12:16 )
|
|
|
Written by Fred Nicolls
|
|
Thursday, 13 November 2008 12:21 |
|
The conference dinner will be held at Simon's restaurant at Groot Constantia. Delegates attending the dinner are required to select one item from each course in the restaurant menu (available here) at least one week in advance. Please email your selections to Karin van Wyk. |
|
Last Updated ( Thursday, 13 November 2008 12:28 )
|
|
Written by Fred Nicolls
|
|
Wednesday, 12 November 2008 09:14 |
Thursday 27 NovemberDavid van Leeuwen: The difference that South Africa has made to Speaker Recognition Automatic speaker recognition is an area of speech technology that has received much attention from speech researchers in recent years. Some believe that it is the cleanest of all speech related recognition problems. Although simple in its formulation, the speaker recognition problem appears to have an intricate relation with its application. Text independent Speaker Recognition can be seen as a pattern recognition problem, where features are highly variable sequences related to a single source. The task is to detect whether the source is of known identity.
The engineering of speaker recognition systems depends largely on the availability of example material, in this case speech recordings of thousands of different speakers. Performance is driven by international benchmark evaluations which have been carried out almost every year since 1996 by the National Institute of Standards and Technology in the United States. These competitive evaluations donate new evaluation data to the research community, which guides research directions. In recent years, the challenge of channel and session variability has been the focus of these evaluations.
In this presentation, the typical characteristics of the speaker recognition approach are reviewed, and an overview of the machine learning techniques employed is given.
In recent years, new approaches to the presentation of the speaker recognition output have been developed. This way of presentation makes the technology applicable in a wider range of applications without the need of recalibration. Both in the attempts to overcome channel variability and the application-independent presentation of speaker recognition output researchers from South Africa have played an important role. Friday 28 November Thomas Hain: The Careful Listener: Speech Processing in Meetings Meetings form an essential part of life for many people and the time spent in face to face meetings is ever increasing while more and more people complain about inefficiency, lack of planning and loss of information. Meetings have to be postponed due to lack of information at the time, essential participants that could not attend or deviation from the real topics at hand. While we are normally very eager to use tools that help to increase productivity in many areas, meetings seem to have been mostly excluded in this quest.
Under the AMI and AMIDA projects observant technologies are developed that aim to assist humans in their tasks rather than replacing their functions. While many of these technologies use several modalities (such as video, speech, handwriting, etc) at the same time, the most important information to date can be derived from speech signals alone. However, most known algorithms have to be altered to cope with the complex acoustic situation and special information not relevant in other domains can be derived.
In this presentation a brief overview of the AMIDA project is given, followed by a discussion of required information for several applications. The information related to speech signals are the speakers identity and location, the timing, the content, the presentation style. Hence speaker diarisation, speaker tracking, and speech recognition are at the core of speech technologies used. The presentation will give an overview of state of the art systems for meetings and their performance. Since processing should be minimally invasive microphone array processing is fundamental to all systems presented. Examples of systems for higher level information extraction using the output of these speech processing algorithms are given. |
|
Last Updated ( Wednesday, 12 November 2008 09:27 )
|
|
Written by Fred Nicolls
|
|
Friday, 07 November 2008 13:58 |
Papers accepted for oral presentationsThese papers will appear in full in the conference proceedings. Any revisions (optional or required) to papers must be received by the end of Wednesday 12 November. Each oral will be allocated 20 minutes, including questions. 2. A Radial Basis Function Neural Network for Facial Classification to aid in Fetal Alcohol Syndrome Diagnosis (withdrawn)
| | 3 - Rapid 3D Measurement and Influences on Precision Using Digital Video Cameras | 6 - Ensemble Feature Selection for Hyperspectral Imagery
| | 7 - Data requirements for speaker independent acoustic models | | 9 - Action Classification using the Average of Pose Changes | | 13 - Fast Calculation of Digitally Reconstructed Radiographs using Light Fields | | 14 - Relationship between Structural Diversity and Performance of Multiple Classifiers for Decision Support | | 16 - Fiducial-based monocular 3D displacement measurement of breakwater armour unit models | | 17 - A channel normalization for speech recognition in mismatched conditions | | 19 - Traffic sign detection and classification using colour and shape cues | 20 - Heuristics for State Splitting in Hidden Markov Models
| | 22 - Acoustic cues identifying phonetic transitions for speech segmentation | | 23 - Hough Transform Tuned Bayesian Classifier for Overhead Power Line Inspection | 25 - An Introduction to Diffusion Maps
| 26 - The hitchhiker's guide to the particle filter
| | 27 - The origin of the Afrikaans pronunciation: a comparison to west Germanic languages and Dutch dialects | 28 - Impact Assessment of Missing Data Imputation Models
| 29 - Binary Naive Bayesian classifiers for correlated Gaussian features: A theoretical analysis
| 30 - 3D Phase Unwrapping of DENSE MRI Images Using Region Merging
| | 31 - Alignment invariant image comparison implemented on the GPU | | 32 - A note on the separability index | 34 - Extending DTGologto Deal with POMDPs
| | 35 - Acoustic analysis of diphthongs in Standard South African English | | 36 - Experiments in automatic assessment of oral proficiency and listening comprehension for bilingual South African speakers | 37 - Real-time surface tracking with uncoded structured light
| 39 - Homophone Disambiguation in Afrikaans
| 40 - Evaluating Topic Models with Stability
| | 41 - Speect: a multilingual text-to-speech system | | 44 - Photometric modelling of real-world objects |
Papers accepted for work-in-progress poster presentationsAbstracts of these papers will appear in the conference proceedings, and an electronic document containing these papers will be made available. Posters should not exceed A0 in size, but the format is otherwise unspecified. | 4. Segmentation of Candidate Bacillus Objects in Ziehl Neelsen Stained Sputum Images Using Deformable Models | | 5. Improving Iris-based Personal Identification using Maximum Rectangular Region Detection | | 8. A Shader-based GPU Implementation of the Fast Fourier Transform | | 10. Assessing the impact of missing data using computational intelligence and decision forest | | 11. An optimised parametric speech synthesis model based on Linear prediction (LP) and the Harmonic plus noise model (HNM) | | 12. Inductive Reasoning in Description Logics | | 15. Impact of data imputation on decision support | | 18. Evaluating techniques to binarize historic cosmic-ray data | | 24. The Kernel Fisher Discriminant for learning bioinformatic data sets. | | 33. A GPU-customized visual hull reconstruction algorithm for real-time applications | 38. A Readability Formula for Afrikaans
| | 42. Porting A Spoken Language Identification SYSTEM to a new environment | | 43. Effects of the Type of Missingness of Data on Artifical INtelligence Predition |
|
|
Last Updated ( Monday, 17 November 2008 13:15 )
|
|
|
|
|
|
|
|
Important dates
<< January 2009 >>
| Mo | Tu | We | Th | Fr | Sa | Su |
| | | | 1 | 2 | 3 | 4 |
| 5 | 6 | 7 | 8 | 9 | 10 | 11 |
| 12 | 13 | 14 | 15 | 16 | 17 | 18 |
| 19 | 20 | 21 | 22 | 23 | 24 | 25 |
| 26 | 27 | 28 | 29 | 30 | 31 | |
|