Home Programme
Programme
Detailed programme PDF Print E-mail
Written by Fred Nicolls   
Monday, 17 November 2008 11:11

A print version of the detailed programme is available here.

Delegates will receive a printed copy of the proceedings at the conference.  An electronic copy is available here.

Thursday 27 November

8:00-9:00 Registration
9:00-9:05 Welcome
9:05-9:50 Plenary talk: David van Leeuwen
The difference that South Africa has made to Speaker Recognition
9:50-10:50 Pattern recognition I
  Heuristics for State Splitting in Hidden Markov Models
Benjamin Murrell and Jules Raymond Tapamo
  Binary Naive Bayesian classifiers for correlated Gaussian features: A theoretical analysis
Ewald van Dyk and Etienne Barnard
  An Introduction to Diffusion Maps
J. de la Porte, B. M. Herbst, W. Hereman, and S. J. van der Walt
10:50-11:10 TEA
11:10-12:50 Pattern recognition II
  Ensemble Feature Selection for Hyperspectral Imagery
Gidudu, A., Abe, B. and Marwala, T.
  The hitchhiker's guide to the particle filter
McElory Hoffmann, Karin Hunter, and Ben Herbst
  Impact Assessment of Missing Data Imputation Models<
Dan Golding and Tshilidzi Marwala
  A note on the separability index
Linda Mthembu and Tshilidzi Marwala
  Extending DTGologto Deal with POMDPs
Gavin Rens, Alexander Ferrein, and Etienne van der Poel
12:50-14:20 LUNCH
14:20-16:00 Pattern recognition applications
  Acoustic cues identifying phonetic transitions for speech segmentation
D.R. van Niekerk and E. Barnard
  Photometric modelling of real-world objects
John Morkel and Fred Nicolls
  Experiments in automatic assessment of oral proficiency and listening comprehension for bilingual South African speakers
Febe de Wet, Pieter Muller, Christa van der Walt, and Thomas Niesler
  Rapid 3D Measurement and Influences on Precision Using Digital Video Cameras
Willie van der Merwe and Kristiaan Schreve
  Evaluating Topic Models with Stability
Alta de Waal and Etienne Barnard
16:00-17:30 TEA AND POSTERS
19:30- CONFERENCE DINNER: Simon's Restaurant at Groot Constantia.  Sponsored by Oxford Metrics Group.

Friday 28 November

9:00-9:45 Plenary talk: Thomas Hain
The Careful Listener: Speech Processing in Meetings
9:45-10:45 Parallel track I (Pattern recognition/speech)
  Porting A Spoken Language Identification SYSTEM to a new environment
Marius Peche, Marelie Davel, and Etienne Barnard
  Relationship between Structural Diversity and Performance of Multiple Classifiers for Decision Support
R. Musehane, F. A. Netshiongolwe, L. Masisi, F. V. Nelwamondo, and T. Marwala
  A channel normalization for speech recognition in mismatched conditions
Neil Kleynhans and Etienne Barnard
  Parallel track I (Imaging and vision)
  Action Classification using the Average of Pose Changes
Janto F. Dreijer and Ben M. Herbst
  Real-time surface tracking with uncoded structured light
Willie Brink
  Fiducial-based monocular 3D displacement measurement of breakwater armour unit models
R. Vieira, F. van den Bergh, and B. J. van Wyk
10:45-11:05 TEA
11:05-12:45 Parallel track II (Speech)
  Data requirements for speaker independent acoustic models
Jacob A. C. Badenhorst and Marelie Davel
  Acoustic analysis of diphthongs in Standard South African English
Olga Martirosian and Marelie Davel
  The origin of the Afrikaans pronunciation: a comparison to west Germanic languages and Dutch dialects
Wilbert Heeringa and Febe de Wet
  Speect: a multilingual text-to-speech system
J. A. Louw
  Homophone Disambiguation in Afrikaans
Hendrik J. Groenewald and Marissa van Rooyen
11:05-12:45 Parallel track II (Imaging and vision)
  3D Phase Unwrapping of DENSE MRI Images Using Region Merging
Joash N. Ongori, Ernesta M. Meintjes, and Bruce S. Spottiswoode
  Fast Calculation of Digitally Reconstructed Radiographs using Light Fields
Cobus Carstens and Neil Muller
  Traffic sign detection and classification using colour and shape cues
F. P. Senekal
  Hough Transform Tuned Bayesian Classifier for Overhead Power Line Inspection
Z. R. S. Gaspar, Shengzhi Du, and B. J. van Wyk
  Alignment invariant image comparison implemented on the GPU
Hans Roos, Yuko Roodt, and Willem A. Clarke
12:45-13:00 Annual general meeting
13:00-13:30 LUNCH
Last Updated ( Monday, 24 November 2008 12:16 )
 
Conference dinner PDF Print E-mail
Written by Fred Nicolls   
Thursday, 13 November 2008 12:21

The conference dinner will be held at Simon's restaurant at Groot Constantia.

Delegates attending the dinner are required to select one item from each course in the restaurant menu (available here) at least one week in advance.  Please email your selections to Karin van Wyk. 

Last Updated ( Thursday, 13 November 2008 12:28 )
 
Invited talks PDF Print E-mail
Written by Fred Nicolls   
Wednesday, 12 November 2008 09:14

Thursday 27 November

David van Leeuwen:  The difference that South Africa has made to Speaker Recognition

 Automatic speaker recognition is an area of speech technology that has
received much attention from speech researchers in recent years.  Some
believe that it is the cleanest of all speech related recognition
problems.  Although simple in its formulation, the speaker recognition
problem appears to have an intricate relation with its application.
Text independent Speaker Recognition can be seen as a pattern
recognition problem, where features are highly variable sequences
related to a single source.  The task is to detect whether the source
is of known identity.

The engineering of speaker recognition systems depends largely on the
availability of example material, in this case speech recordings of
thousands of different speakers.  Performance is driven by
international benchmark evaluations which have been carried out almost
every year since 1996 by the National Institute of Standards and
Technology in the United States.  These competitive evaluations
donate new evaluation data to the research community, which guides
research directions.  In recent years, the challenge of channel and
session variability has been the focus of these evaluations.

In this presentation, the typical characteristics of the speaker
recognition approach are reviewed, and an overview of the machine
learning techniques employed is given.

In recent years, new approaches to the presentation of the speaker
recognition output have been developed.  This way of presentation makes
the technology applicable in a wider range of applications without the
need of recalibration.  Both in the attempts to overcome channel
variability and the application-independent presentation of speaker
recognition output researchers from South Africa have played an
important role.

 

Friday 28 November

Thomas Hain: The Careful Listener: Speech Processing in Meetings

 Meetings form an essential part of life for many people and the time spent in face to face meetings is ever increasing while more and more people complain about inefficiency, lack of planning and loss of information. Meetings have to be postponed due to lack of information at the time, essential participants that could not attend or deviation from the real topics at hand. While we are normally very eager to use tools that help to increase productivity in many areas, meetings seem to have been mostly excluded in this quest.

Under the AMI and AMIDA projects observant technologies are developed that aim to assist humans in their tasks rather than replacing their functions. While many of these technologies use several modalities (such as video, speech, handwriting, etc) at the same time, the most important information to date can be derived from speech signals alone. However, most known algorithms have to be altered to cope with the complex acoustic situation and special information not relevant in other domains can be derived.

In this presentation a brief overview of the AMIDA project is given, followed by a discussion of required information for several applications. The information related to speech signals are the speakers identity and location, the timing, the content, the presentation style. Hence speaker diarisation, speaker tracking, and speech recognition are at the core of speech technologies used. The presentation will give an overview of state of the art systems for meetings and their performance. Since processing should be minimally invasive microphone array processing is fundamental to all systems presented. Examples of systems for higher level information extraction using the output of these speech processing algorithms are given.
 

Last Updated ( Wednesday, 12 November 2008 09:27 )
 
Papers accepted PDF Print E-mail
Written by Fred Nicolls   
Friday, 07 November 2008 13:58

Papers accepted for oral presentations

These papers will appear in full in the conference proceedings.  Any revisions (optional or required) to papers must be received by the end of Wednesday 12 November.  Each oral will be allocated 20 minutes, including questions.

2. A Radial Basis Function Neural Network for Facial Classification to aid in Fetal Alcohol Syndrome Diagnosis (withdrawn)
3 - Rapid 3D Measurement and Influences on Precision Using Digital Video Cameras
6 - Ensemble Feature Selection for Hyperspectral Imagery
7 - Data requirements for speaker independent acoustic models
9 - Action Classification using the Average of Pose Changes
13 - Fast Calculation of Digitally Reconstructed Radiographs using Light Fields
14 - Relationship between Structural Diversity and Performance of Multiple Classifiers for Decision Support
16 - Fiducial-based monocular 3D displacement measurement of breakwater armour unit models
17 - A channel normalization for speech recognition in mismatched conditions
19 - Traffic sign detection and classification using colour and shape cues
20 - Heuristics for State Splitting in Hidden Markov Models
22 - Acoustic cues identifying phonetic transitions for speech segmentation
23 - Hough Transform Tuned Bayesian Classifier for Overhead Power Line Inspection
25 - An Introduction to Diffusion Maps
26 - The hitchhiker's guide to the particle filter
27 - The origin of the Afrikaans pronunciation: a comparison to west Germanic languages and Dutch dialects
28 - Impact Assessment of Missing Data Imputation Models
29 - Binary Naive Bayesian classifiers for correlated Gaussian features: A theoretical analysis
30 - 3D Phase Unwrapping of DENSE MRI Images Using Region Merging
31 - Alignment invariant image comparison implemented on the GPU
32 - A note on the separability index
34 - Extending DTGologto Deal with POMDPs
35 - Acoustic analysis of diphthongs in Standard South African English
36 - Experiments in automatic assessment of oral proficiency and listening comprehension for bilingual South African speakers
37 - Real-time surface tracking with uncoded structured light
39 - Homophone Disambiguation in Afrikaans
40 - Evaluating Topic Models with Stability
41 - Speect: a multilingual text-to-speech system
44 - Photometric modelling of real-world objects

 

Papers accepted for work-in-progress poster presentations

Abstracts of these papers will appear in the conference proceedings, and an electronic document containing these papers will be made available.  Posters should not exceed A0 in size, but the format is otherwise unspecified.

4. Segmentation of Candidate Bacillus Objects in Ziehl Neelsen Stained Sputum Images Using Deformable Models
5. Improving Iris-based Personal Identification using Maximum Rectangular Region Detection
8. A Shader-based GPU Implementation of the Fast Fourier Transform
10. Assessing the impact of missing data using computational intelligence and decision forest
11. An optimised parametric speech synthesis model based on Linear prediction (LP) and the Harmonic plus noise model (HNM)
12. Inductive Reasoning in Description Logics
15. Impact of data imputation on decision support
18. Evaluating techniques to binarize historic cosmic-ray data
24. The Kernel Fisher Discriminant for learning bioinformatic data sets.
33. A GPU-customized visual hull reconstruction algorithm for real-time applications
38. A Readability Formula for Afrikaans
42. Porting A Spoken Language Identification SYSTEM to a new environment
43. Effects of the Type of Missingness of Data on Artifical INtelligence Predition

 

Last Updated ( Monday, 17 November 2008 13:15 )
 


 

Important dates

<<  January 2009  >>
 Mo  Tu  We  Th  Fr  Sa  Su 
     1  2  3  4
  5  6  7  8  91011
12131415161718
19202122232425
262728293031