Repository logo
 

CASL

Permanent URI for this collectionhttps://eresearch.qmu.ac.uk/handle/20.500.12289/22

Browse

Search Results

Now showing 1 - 10 of 21
  • Thumbnail Image
    Item
    Improving ultrasound post estimation accuracy by training on co-registered EMA data
    (University of Aizu, 2024-06-28) Balch-Tomes, Jonathan; Wrench, Alan A.; Scobbie, James M.; Macmartin, C.; Turk, A.
    This study aims to assess how accurately DeepLabCut [1], when applied to ultrasound tongue images, can estimate Electromagnetic Articulography (EMA) sensor positions. EMA provides objective measures of anterior tongue, jaw, and lip kinematics. DeepLabCut pose estimation is a powerful method of extracting keypoint positions from midsagittal ultrasound images of the tongue. It has an advantage over EMA in that it can be applied to the whole of the tongue from tip to root as well as the jaw and the hyoid. After correction for probe translation standard error in the estimation of keypoint positions compared to the corresponding EMA sensor positions was 1.2-1.5mm along the tongue contour and 0.5-0.9mm perpendicular to the tongue contour.
  • Thumbnail Image
    Item
    Similar and different tongue surface contours: intra-speaker controls in ultrasound analysis
    (2023-08) Scobbie, James M.
    Ultrasound studies of speech production analyse differences in dependent variables reflecting the tongue surface’s location and shape. Inferential statistics distinguish theoretically-relevant from random effects, somewhat independently of the descriptive size of significant effects. Experimental designs induce measurable dependent changes by manipulating independent variables such as prosody, phonemic target, etc. This paper presents descriptive statistics quantifying holistically all 15 pairwise differences between six monophthongal long vowel phonemes of one variety of English, comparing these to experimental noise differences attributable to the use of two identical blocks of data collection in sequence. Eight speakers were recorded, using two different ultrasound systems, and analysed in AAA using both edge-tracking and DeepLabCut pose estimation. The smallest phonemic contrast (~2mm) was greater than the experimental noise (~1mm), and was well evidenced by AAA’s t-test of radial difference.
  • Thumbnail Image
    Item
    Automated assessment of hyoid movement during normal swallow using ultrasound
    (Royal College of Speech and Language Therapists, 2022-03-14) Ma, Joan K-Y; Wrench, Alan A.
    Background The potential for using ultrasound by speech and language therapists (SLTs) as an adjunct clinical tool to assess swallowing function has received increased attention during the COVID-19 pandemic, with a recent review highlighting the need for further research on normative data, objective measurement, elicitation protocol and training. The dynamic movement of the hyoid, visible in ultrasound, is crucial in facilitating bolus transition and protection of the airway during a swallow and has shown promise as a biomarker of swallowing function.
  • Thumbnail Image
    Item
    Translating ultrasound into clinical practice for the assessment of swallowing and laryngeal function: A speech and language pathology-led consensus study
    (Springer, 2022-02-24) Allen, Jodi E.; Clunie, Gemma; Ma, Joan K-Y; Coffey, Margaret; Winiker, Katharina; Richmond, Sally; Lowell, Soren Y.; Volkmer, Anna
    Ultrasound (US) has an emerging evidence base for the assessment of swallowing and laryngeal function. Accessibility and technological advances support the use of US as a clinical assessment tool; however, there is insufficient evidence to support its translation into clinical practice. This study aimed to establish consensus on the priorities for translation of US into clinical practice for the assessment of swallowing and laryngeal function. Nominal Group Technique (NGT) was used as a formal method of consensus development. Clinicians and academics, all members of an international US working group, were invited to participate in the study. Two NGT meetings were held, where participants silently generated and then shared ideas. Participants anonymously ranked items. Rankings were aggregated before participants re-ranked items in order of priority. Discussions regarding rankings were recorded and transcribed to inform analysis. Member-checking with participants informed the final analysis. Participants (n = 15) were speech and language pathologists, physiotherapists and sonographers representing six countries. Fifteen items were identified and prioritised 1–13 (including two equally ranked items). Reliability, validity and normative data emerged as key areas for research while development of training protocols and engagement with stakeholders were considered vital to progressing US into practice. Analysis revealed common themes that might be addressed together in research, in addition to the ranked priority. A measured approach to the translation of US into clinical practice will enable effective implementation of this tool. Priorities may evolve as clinical and professional contexts shift, but this study provides a framework to advance research and clinical practice in this field.
  • Thumbnail Image
    Item
    Say again? Individual articulatory strategies for producing a clearly-spoken minimal pair wordlist
    (International Phonetic Association, 2019-08-10) Scobbie, James M.; Ma, Joan K-Y
    We describe articulatory differences (lingual and labial) between two versions (neutral and clear) of a CVC wordlist of 12 targets (V = /ieaɔoʉ/; C_C = /p_p/ or /m_m/). A companion paper describes the background; the participants, materials and tasks; the impressionistic and acoustic results. Labial measures reflect vowel opening (and edge-spreading) and consonant compression using fleshpoint markers captured by head-mounted video. Consonant closure and total word duration are based on visual judgement of complete closure. Ultrasound data provides the absolute area between neutral and clear mid-sagittal tongue-surface splines at the maximum of each vowel target, and a qualitative description of tongue shape and location. Strong and systematic interspeaker variation was evident in how articulation, acoustics and functional clarity were enhanced. Some large phonologically motivated segmental hyperspeech enhancements were observed, but they were not related straightforwardly to the phonological oppositions in the material nor consistently used by all speakers. Differences in utterance initiation are also discussed.
  • Thumbnail Image
    Item
    Visualising speech: Identification of atypical tongue-shape patterns in the speech of children with cleft lip and palate using ultrasound technology
    (NHS Greater Glasgow & Clyde and University of Strathclyde, Glasgow, 2018) Lloyd, Susan; Cleland, Joanne; Crampin, Lisa; Campbell, Linsay; Zharkova, Natalia; Palo, Pertti
    Previous research by Gibbon (2004) shows that at least 8 distinct error types can be identified in the speech of people with cleft lip and palate (CLP) using electropalatography (EPG), a technique which measures tongue-palate contact. However, EPG is expensive and logistically difficult. In contrast, ultrasound is cheaper and arguably better equipped to image the posterior articulations (such as pharyngeals) which are common in CLP. A key aim of this project is to determine whether the eight error types made visible with EPG in CLP speech described by Gibbon (2004) can be also be identified with ultrasound. This paper will present the first results from a larger study developing a qualitative and quantitative ultrasound speech assessment protocol. Data from the first 20 children aged 3 to 18 with CLP will be presented. Data are spoken materials from the CLEFTNET protocol. We will present a recording format compatible with CAPS-A to record initial observations from the live ultrasound (e.g. double articulations, pharyngeal stops). Two Speech and Language Therapists analysed the data independently to identify error types. Results suggest that all of the error types, for example fronted placement and double articulations can be identified using ultrasound, but this is challenging in real-time. Ongoing work involves quantitative analysis of error types using articulatory measures.
  • Thumbnail Image
    Item
    Visualising speech: Using ultrasound visual biofeedback to diagnose and treat speech disorders in children with cleft lip and palate
    (NHS Greater Glasgow & Clyde and University of Strathclyde, Glasgow, 2017-09) Cleland, Joanne; Crampin, Lisa; Zharkova, Natalia; Wrench, Alan A.; Lloyd, Susan; Palo, Pertti
    Children with cleft lip and palate (CLP) often continue to have problems producing clear speech long after the clefts have been surgically repaired, leading to educational and social disadvantage. Speech is of key importance in CLP from both a quality of life and surgical outcome perspective, yet assessment relies on subjective perceptual methods, with speech and language therapists (SLTs) listening to speech and transcribing errors. This is problematic because perception-based phonetic transcription is well known to be highly unreliable(Howard & Lohmander, 2011) especially in CLP, where the range of error types is arguably far greater than for other speech sound disorders. Moreover,CLP speech is known to be vulnerable to imperceptible error types, such as double articulations which can only be understood with instrumental techniques such as ultrasound tongue imaging (UTI). Incorrect transcription of these errors can result in misdiagnosis and subsequent inappropriate intervention which can lead to speech errors becoming deeply ingrained.
  • Thumbnail Image
    Item
    Tongue shape dynamics in swallowing using sagittal ultrasound
    (Springer, 2018-06-28) Ohkubo, Mai; Scobbie, James M.
    Ultrasound imaging is simple, repeatable, gives real-time feedback, and its dynamic soft tissue imaging may make it superior to other modalities for swallowing research. We tested this hypothesis and measured certain spatial and dynamic aspects of the swallowing to investigate its efficacy. Eleven healthy adults wearing a headset to stabilize the probe participated in the study. Both thickened and thin liquids were used, and liquid bolus volumes of 10 and 25 ml were administered to the subjects by using a cup. The tongue's surface was traced as a spline superimposed on a fan-shaped measurement space for every image from the time at which the tongue blade started moving up toward the palate at the start of swallowing to the time when the entire tongue was in contact with the palate. To measure depression depth, the distance (in mm) was measured along each radial fan line from the location at which the tongue's surface spline intersected the fan line to the point where the hard palate intersected the fan line at each timepoint. There were differences between individual participants in the imageability of the swallow, and so we defined quantitatively "measureable" and "unmeasurable" types. The most common type was measureable, in which we could find a clear bolus depression in the cupped tongue's surface. Indeed, with 10 ml of thin liquids, we were able to find and measure the depression depth for all participants. The average maximum radial distance from the palate to the tongue's surface was 20.9 mm (median) (IQR: 4.3 mm) for swallowing 10 ml of thin liquid compared to 24.6 mm (IQR: 3.3 mm) for 25 ml of thin liquid swallow (p < 0.001). We conclude that it is possible to use ultrasound imaging of the tongue to capture spatial aspects of swallowing.
  • Thumbnail Image
    Item
    Biological and social grounding of phonology : variation as a research tool
    (2007-08) Scobbie, James M.; Acknowledgements. Financial support from ESRC (RES-000-22-2032).
    Phonological-phonetic sound systems are abstractions away from substance, so while they are grounded in biological capacity, they also reflect phonetically un-natural relationships arising from a variety of linguistic factors. Sociolinguistic variation is one of these non-biological factors. Pilot articulatory results are presented from derhoticised Scottish English. It can have onset/ coda allophony far more radical than the systems that are normally examined in articulatory research. Ultrasound analysis shows acoustic rhoticity in codas may have a post-alveolar constriction so delayed that acoustic rhoticity is covert. Perceptual recoverability of social identity has to be considered in addition to plain phonetic factors.
  • Thumbnail Image
    Item
    Articulation therapy for children with cleft palate using visual articulatory models and ultrasound biofeedback
    (International Phonetic Association, 2015-08-15) Roxburgh, Zoe; Scobbie, James M.; Cleland, Joanne
    Visual biofeedback tools, such as Electropalatography (EPG), are recommended for assessing and treating speech sound disorders (SSDs) associated with Cleft Palate (CP). However, EPG is not suitable for all clients, due to dependencies on stable dentition and timing of palatal repair. Ultrasound is becoming increasingly popular for its use in treating SSDs, with no reports on its dependency on structure of the vocal tract. However its clinical application in the CP population remains to be tested. We compared Visual Articulatory Models (VAMs) with Ultrasound for the treatment of SSDs in two children with repaired submucous CP. Both children received two blocks of therapy each with eight sessions, with the first block using VAMs and the second using ultrasound. Results showed that both children improved overall, with more improvement found in the first block of therapy using VAMs.