Repository logo
 

CASL

Permanent URI for this collectionhttps://eresearch.qmu.ac.uk/handle/20.500.12289/22

Browse

Search Results

Now showing 1 - 10 of 24
  • Thumbnail Image
    Item
    Improving ultrasound post estimation accuracy by training on co-registered EMA data
    (University of Aizu, 2024-06-28) Balch-Tomes, Jonathan; Wrench, Alan A.; Scobbie, James M.; Macmartin, C.; Turk, A.
    This study aims to assess how accurately DeepLabCut [1], when applied to ultrasound tongue images, can estimate Electromagnetic Articulography (EMA) sensor positions. EMA provides objective measures of anterior tongue, jaw, and lip kinematics. DeepLabCut pose estimation is a powerful method of extracting keypoint positions from midsagittal ultrasound images of the tongue. It has an advantage over EMA in that it can be applied to the whole of the tongue from tip to root as well as the jaw and the hyoid. After correction for probe translation standard error in the estimation of keypoint positions compared to the corresponding EMA sensor positions was 1.2-1.5mm along the tongue contour and 0.5-0.9mm perpendicular to the tongue contour.
  • Thumbnail Image
    Item
    Similar and different tongue surface contours: intra-speaker controls in ultrasound analysis
    (2023-08) Scobbie, James M.
    Ultrasound studies of speech production analyse differences in dependent variables reflecting the tongue surface’s location and shape. Inferential statistics distinguish theoretically-relevant from random effects, somewhat independently of the descriptive size of significant effects. Experimental designs induce measurable dependent changes by manipulating independent variables such as prosody, phonemic target, etc. This paper presents descriptive statistics quantifying holistically all 15 pairwise differences between six monophthongal long vowel phonemes of one variety of English, comparing these to experimental noise differences attributable to the use of two identical blocks of data collection in sequence. Eight speakers were recorded, using two different ultrasound systems, and analysed in AAA using both edge-tracking and DeepLabCut pose estimation. The smallest phonemic contrast (~2mm) was greater than the experimental noise (~1mm), and was well evidenced by AAA’s t-test of radial difference.
  • Thumbnail Image
    Item
    Automated assessment of hyoid movement during normal swallow using ultrasound
    (Royal College of Speech and Language Therapists, 2022-03-14) Ma, Joan K-Y; Wrench, Alan A.
    Background The potential for using ultrasound by speech and language therapists (SLTs) as an adjunct clinical tool to assess swallowing function has received increased attention during the COVID-19 pandemic, with a recent review highlighting the need for further research on normative data, objective measurement, elicitation protocol and training. The dynamic movement of the hyoid, visible in ultrasound, is crucial in facilitating bolus transition and protection of the airway during a swallow and has shown promise as a biomarker of swallowing function.
  • Thumbnail Image
    Item
    Translating ultrasound into clinical practice for the assessment of swallowing and laryngeal function: A speech and language pathology-led consensus study
    (Springer, 2022-02-24) Allen, Jodi E.; Clunie, Gemma; Ma, Joan K-Y; Coffey, Margaret; Winiker, Katharina; Richmond, Sally; Lowell, Soren Y.; Volkmer, Anna
    Ultrasound (US) has an emerging evidence base for the assessment of swallowing and laryngeal function. Accessibility and technological advances support the use of US as a clinical assessment tool; however, there is insufficient evidence to support its translation into clinical practice. This study aimed to establish consensus on the priorities for translation of US into clinical practice for the assessment of swallowing and laryngeal function. Nominal Group Technique (NGT) was used as a formal method of consensus development. Clinicians and academics, all members of an international US working group, were invited to participate in the study. Two NGT meetings were held, where participants silently generated and then shared ideas. Participants anonymously ranked items. Rankings were aggregated before participants re-ranked items in order of priority. Discussions regarding rankings were recorded and transcribed to inform analysis. Member-checking with participants informed the final analysis. Participants (n = 15) were speech and language pathologists, physiotherapists and sonographers representing six countries. Fifteen items were identified and prioritised 1–13 (including two equally ranked items). Reliability, validity and normative data emerged as key areas for research while development of training protocols and engagement with stakeholders were considered vital to progressing US into practice. Analysis revealed common themes that might be addressed together in research, in addition to the ranked priority. A measured approach to the translation of US into clinical practice will enable effective implementation of this tool. Priorities may evolve as clinical and professional contexts shift, but this study provides a framework to advance research and clinical practice in this field.
  • Item
    Systematic review of ultrasound visual biofeedback in intervention for speech sound disorders
    (Wiley, 2019-06-10) Sugden, Eleanor; Lloyd, Susan; Lam, Jenny; Cleland, Joanne
    Background As cost and access barriers to ultrasound technology have decreased, interest in using ultrasound visual biofeedback (U-VBF) as a tool for remediating speech sound disorders (SSD) has increased. A growing body of research has investigated U-VBF in intervention for developmental SSD; however, diversity in study design, participant characteristics, clinical methods and outcomes complicate the interpretation of this literature. Thus, there is a need for a synthesis and review of the evidence base for using U-VBF in intervention for SSD.
  • Thumbnail Image
    Item
    Say again? Individual articulatory strategies for producing a clearly-spoken minimal pair wordlist
    (International Phonetic Association, 2019-08-10) Scobbie, James M.; Ma, Joan K-Y
    We describe articulatory differences (lingual and labial) between two versions (neutral and clear) of a CVC wordlist of 12 targets (V = /ieaɔoʉ/; C_C = /p_p/ or /m_m/). A companion paper describes the background; the participants, materials and tasks; the impressionistic and acoustic results. Labial measures reflect vowel opening (and edge-spreading) and consonant compression using fleshpoint markers captured by head-mounted video. Consonant closure and total word duration are based on visual judgement of complete closure. Ultrasound data provides the absolute area between neutral and clear mid-sagittal tongue-surface splines at the maximum of each vowel target, and a qualitative description of tongue shape and location. Strong and systematic interspeaker variation was evident in how articulation, acoustics and functional clarity were enhanced. Some large phonologically motivated segmental hyperspeech enhancements were observed, but they were not related straightforwardly to the phonological oppositions in the material nor consistently used by all speakers. Differences in utterance initiation are also discussed.
  • Thumbnail Image
    Item
    Visualising speech: Identification of atypical tongue-shape patterns in the speech of children with cleft lip and palate using ultrasound technology
    (NHS Greater Glasgow & Clyde and University of Strathclyde, Glasgow, 2018) Lloyd, Susan; Cleland, Joanne; Crampin, Lisa; Campbell, Linsay; Zharkova, Natalia; Palo, Pertti
    Previous research by Gibbon (2004) shows that at least 8 distinct error types can be identified in the speech of people with cleft lip and palate (CLP) using electropalatography (EPG), a technique which measures tongue-palate contact. However, EPG is expensive and logistically difficult. In contrast, ultrasound is cheaper and arguably better equipped to image the posterior articulations (such as pharyngeals) which are common in CLP. A key aim of this project is to determine whether the eight error types made visible with EPG in CLP speech described by Gibbon (2004) can be also be identified with ultrasound. This paper will present the first results from a larger study developing a qualitative and quantitative ultrasound speech assessment protocol. Data from the first 20 children aged 3 to 18 with CLP will be presented. Data are spoken materials from the CLEFTNET protocol. We will present a recording format compatible with CAPS-A to record initial observations from the live ultrasound (e.g. double articulations, pharyngeal stops). Two Speech and Language Therapists analysed the data independently to identify error types. Results suggest that all of the error types, for example fronted placement and double articulations can be identified using ultrasound, but this is challenging in real-time. Ongoing work involves quantitative analysis of error types using articulatory measures.
  • Thumbnail Image
    Item
    Visualising speech: Using ultrasound visual biofeedback to diagnose and treat speech disorders in children with cleft lip and palate
    (NHS Greater Glasgow & Clyde and University of Strathclyde, Glasgow, 2017-09) Cleland, Joanne; Crampin, Lisa; Zharkova, Natalia; Wrench, Alan A.; Lloyd, Susan; Palo, Pertti
    Children with cleft lip and palate (CLP) often continue to have problems producing clear speech long after the clefts have been surgically repaired, leading to educational and social disadvantage. Speech is of key importance in CLP from both a quality of life and surgical outcome perspective, yet assessment relies on subjective perceptual methods, with speech and language therapists (SLTs) listening to speech and transcribing errors. This is problematic because perception-based phonetic transcription is well known to be highly unreliable(Howard & Lohmander, 2011) especially in CLP, where the range of error types is arguably far greater than for other speech sound disorders. Moreover,CLP speech is known to be vulnerable to imperceptible error types, such as double articulations which can only be understood with instrumental techniques such as ultrasound tongue imaging (UTI). Incorrect transcription of these errors can result in misdiagnosis and subsequent inappropriate intervention which can lead to speech errors becoming deeply ingrained.
  • Item
    The dynamics of voiceless sibilant fricative production in children between 7 and 13 years old: An ultrasound and acoustic study
    (Acoustical Society of America (ASA), 2018-09-19) Zharkova, Natalia; Hardcastle, William J.; Gibbon, Fiona E.
    This study reports on dynamic tongue shape and spectral characteristics of sibilant fricatives /s/ and /ʃ/ in Scottish English speaking children aged between 7 and 13 years old. The sequences /əCa/ and /əCi/ were produced by 40 children, with ten participants in each age group, and two-year intervals between successive groups. Productions of the same sequences by ten adults were used for comparison with the children's data. Quantitative dynamic analyses were carried out on spectral information and on ultrasound imaging data on tongue shape. All age groups differentiated between the two consonants in the fricative centroid and in tongue shape. Vowel-on-consonant effects showed consonant-specific patterns across age groups without a consistent increase or decrease in the extent of coarticulation with increasing age. The extent of discriminability between the two fricatives increased with age on both acoustic and articulatory measures. Younger speakers were generally more variable than older speakers. Complementary findings from the centroid and tongue shape measures suggest that age-related differences are due to the ongoing maturation of controlling the tongue in coordination with other articulators, particularly the jaw, throughout childhood.
  • Thumbnail Image
    Item
    Tongue shape dynamics in swallowing using sagittal ultrasound
    (Springer, 2018-06-28) Ohkubo, Mai; Scobbie, James M.
    Ultrasound imaging is simple, repeatable, gives real-time feedback, and its dynamic soft tissue imaging may make it superior to other modalities for swallowing research. We tested this hypothesis and measured certain spatial and dynamic aspects of the swallowing to investigate its efficacy. Eleven healthy adults wearing a headset to stabilize the probe participated in the study. Both thickened and thin liquids were used, and liquid bolus volumes of 10 and 25 ml were administered to the subjects by using a cup. The tongue's surface was traced as a spline superimposed on a fan-shaped measurement space for every image from the time at which the tongue blade started moving up toward the palate at the start of swallowing to the time when the entire tongue was in contact with the palate. To measure depression depth, the distance (in mm) was measured along each radial fan line from the location at which the tongue's surface spline intersected the fan line to the point where the hard palate intersected the fan line at each timepoint. There were differences between individual participants in the imageability of the swallow, and so we defined quantitatively "measureable" and "unmeasurable" types. The most common type was measureable, in which we could find a clear bolus depression in the cupped tongue's surface. Indeed, with 10 ml of thin liquids, we were able to find and measure the depression depth for all participants. The average maximum radial distance from the palate to the tongue's surface was 20.9 mm (median) (IQR: 4.3 mm) for swallowing 10 ml of thin liquid compared to 24.6 mm (IQR: 3.3 mm) for 25 ml of thin liquid swallow (p < 0.001). We conclude that it is possible to use ultrasound imaging of the tongue to capture spatial aspects of swallowing.