Narrative Discourse Productions in Older Language Impaired Learning Disabled Children: Employing Stricter Reliability MeasuresLuanne Henshilwood and Dale OgilvyDepartment of Logopaedics
University of Cape Town







 
ABSTRACT

This study aimed to describe narrative discourse productions of language impaired learning disabled (LILD) persons, using stringent reliability measures. Coherence and cohesion were the measures of analysis employed. Interrater and intrarater reliability measures were calculated and testing for stability of scores across three testing sessions was undertaken. Content and clarity ratings provided a subjective analysis of narrative productions. The results indicated subtle differences in the coherence and cohesion of narrative productions in the LILD compared with controls. The findings of this study support past literature, which calls for greater research in this area using stricter reliability measures.

OPSOMMING
Daar is in  hierdie navorsing  gepoog om narratiewe diskoersproduksie van leergestreemde persone met ‘n taalafwyking te beskryf, met die gebriuk van streng betroubaarheidsmaatreёls. Koherensie an kohesie is die matreёls van analise wat gebruik is. Inter-beskouer  en intrabeskouer- betroubaarheidsmetings is verwerk en toesting vir stabiliteit van punte oor drie toetsessies onderneem. Subjektiewe analise van narratiewe produksies is deur inhoudsberekeninge voorsien. Die resultate het subtiele verskille in die koherensie en kohesie van diskoers in die leergestremde persone met ‘n taalafwyking , in vergelyking met die kontrolegroep, aangedui. Die bevindinge van hierdie narvorsing ondersteun vroeёr literatuurstudies,naamlik dat verdere narvorsing in hierdie area meer doeltreffende betroubaarheidsmaatreёls sal moet gebruik.
Key words: Narrative discourse, older language impaired learning disabled
                     children, reliability measures in discourse.
INTRODUCTION
If narrative is an organiser of human experience, what better and more relevant mode of expression can we use to probe the abilities and disorders that shape human communication?                (Bruner, 1990,pg 286)

       Assessment of discourse in older children and adolescents allows a clinician to examine a child’s ability to manage larger units of discourse as well as to examine their ability to process coherent oral and written texts (Nelson, 1993). Learning disabled children have been found to present with problems in discourse forms, which either supersede linguistic deficits or occur in the absence of structural linguistic problems (Roth and Spekman, 1986).
       According to Klein (1991) learning disabled children may find it difficult to communicate in a free flowing, creative and interesting manner. Children who present with expressive difficulties may produce circumlocutions and confabulations while searching for the correct word, while others may talk endlessly but their language is empty and repetitive (Klein, 1991).
       The popularity of obtaining and analysing samples of children’s discourse in combination with traditional language assessment procedures has increased over the past two decades (Gallagher, 1983; Stickler, 1987; Owens, 1995 in Hux, Sanger, Ried and Maschka, 1997). The popularity can be attributed to the limitations of standardised tests and the limited information these tests provide a clinician ( Hux, Sanger, Ried and Maschka, 1997). Narrative discourse tasks have been found to be good instruments to assess higher level language and cognitive skills (Paul and Smith, 1993).
       Past research on narrative abilities in learning disabled children has presented with many contradictory results. Roth and Spekman (1986) and Ripich and Griffith (1988) found that learning disabled students perform poorer than normal peers in some aspects of narrative productions, but in other aspects perform equally well. Other studies have stated that learning disabled children produce less cohesive narratives than non-learning disabled persons with the main difference being attributed to the use of pronouns as referents and conjunctions as tie elements (Strong and Shaver, 1991; Liles, Duffy, Merritt & Purcell, 1995).
       On the other hand, some studies have found that in many aspects learning disabled children were not found to produce narratives that are significantly different to those of normal children. Ripich and Griffith (1988) found that children with learning disabilities are able to organise their stories according to an appropriate story grammar. Liles et al. (1995) support these findings as they state that learning disabled children appear to be using, or attempt to use information typical of a normally developing child regarding production of narrative discourse. In addition, past studies have shown that the differences between the language impaired learning disabled and control groups are small and insignificant or none at all (Ripich and Griffith, 1988). This indicates there are often more similarities between the groups than differences (Roth and Spekman, 1986).
       Strong and Shaver (1991) attribute the conflicting results to the unreliability of scores given the analysed elements of the discourse. With all the attention being placed on the method of sampling language, there is still little agreement as to the most reliable and valid procedure (Morris-Friehe and Sanger, 1992).
       
       In more recent studies validity and reliability have become increasingly important aspects to language sampling and analysis of discourse productions (Strong and Shaver, 1991; Morris-Friehe and Sanger; 1992 Hux et al. 1997). Hux et al. (1997) highlight many methodological issues that can contribute to the reduced reliability in such research. The variability of language across context and tasks, the diverse nature of language, little normative data on older children and the heterogeneous nature of clients all contribute to the controversy in this area of research (Morris- Friehe and Sanger, 1992; Nelson, 1993; Hux et al., 1997).
       Reliability becomes an important concern when discourse analysis highlights the pragmatic nature of language resulting in analysis that sometimes offers vague descriptions (Hux et al., 1997). In order for results to be interpreted with confidence, researchers have collected data across a number of different testing times spaced at brief intervals to determine whether the scores do not fluctuate greatly across testing sessions (Strong and Shaver, 1991; Morris-Friehe and Sanger, 1992). Strong and Shaver (1991) further recommended the use of important reliability measures such as intra-coder agreement; internal consistency of responses and stability of scores across testing sessions before the researcher can generalise any results to the general population.
       In summary, from a review of the literature, conflicting findings show a clear need for further research in the production of discourse in the language impaired learning disabled population. Furthermore, strict reliability measures are required. This study has two aims. Firstly to assess and describe the narrative abilities of older language impaired learning disabled children and secondly to employ strict reliability measures.


METHOD
AIMS
PRIMARY AIMS
       The primary aims of this study are:1) to describe the narrative discourse productions of older language impaired learning disabled children; 2) to employ stringent measures of reliability.

SPECIFIC AIMS
       The specific aims of this study are:1) to analyse the language impaired learning disabled narrative discourse productions on a macrostructure level, using the measures of coherence and  cohesion; 2) to determine inter and intra-coder agreement and stability of scores across testing sessions.

RESEARCH DESIGN
       A parallel case study design was employed in this study as it was considered the most appropriate method of research in the learning disabled population due to the heterogeneity of this population. The heterogeneous nature of the learning disabled population has been recognised for many years (Wiig and Semel, 1986; Morris -Friehe and Sanger, 1992; Nelson, 1993).

SUBJECTS
Sample size
       Three language impaired learning disabled children (LILD 1, LILD 2 and LILD 3) between the ages of 11-13 years, were assessed in this study.
       Three non- learning disabled persons with no history of learning disabilities or any other neurological or behavioural disorders were included in this study as control subjects(C1, C2 and C3). These control subjects were matched for age and sex. The control subjects were not required to match for grade as the LILD children have repeated grades at school. The inclusion of these non-learning disabled subjects allows for the validation of the tasks and the obtained results.    

Subject selection criteria
       A number of criteria were applied in the process of selecting the three subjects. The subjects were required to have:1) been diagnosed as language impaired learning disabled by a speech and language pathologist and other professionals ;2) to be attending a school for the learning disabled ;3) to be impaired in either the receptive and/or expressive areas of language ; 4) been diagnosed as learning disabled not attributed to cultural differences; 5) have an average or above average non-verbal I.Q., ranging from 85 onwards, as determined by a formal intellectual ability test; 6) the subjects were required to be in the 11.0 to 13.0 year range. This choice of age group was influence by developmental aspects of narrative discourse. Applebee (1978) stated that children over the age of six years are able to produce an ideal or adult like narrative structure, although development continues over the age of ten years; 7) to be first language English speaking.

Subject Description
       The biographical and clinical information of the three LILD subjects is presented in Table 1 below.

Table 1: Biological and clinical information of the LILD subjects.

    LILD1    LILD2    LILD3
Chronological Age    12.8 years    11.0 years    12.9 years
Sex    male    male    female
Grade    6    5    4
No. of years at school    4    3    5
No. of yr. in therapy    > 5     > 6    > 5
Medication    Ritalin    Ritalin    No medication
I.Q    Above average    Above average    Above average
Current therapy    Speech , twice weekly    Speech , twice weekly    Speech, twice weekly
1st Language    English    English    English
Speech therapy reports    -    limited expressive language ability
-    limited auditory perceptual abilities
-    vocabulary age appropriate    -    limited expressive language
-    limited auditory perceptual abilities
-    word finding difficulty
-    vocabulary below average    -    poor receptive and expressive language abilities
-    limited auditory perceptual abilities
-    vocabulary below average
General education reports    -    poor written skills
-    received remedial lessons
-    poor arithmetic    -    poor written language skills
-    received remedial lessons
-    received occupational therapy for visual-motor integration    -    poor written skills
-    poor arithmetic


DATA COLLECTION AND INSTRUMENTATION
       The researcher met with each subject and control subject three times. In each session one story was elicited. A total of 18 narrative samples were collected, three from each subject. The procedure of collecting a narrative sample three times was undertaken for the following reasons:
       Firstly, to examine the stability of narrative productions across a number of testing sessions. Secondly, to determine whether more than one sample of the child’s narrative yielded more reliable results when analysing the appropriate elements. Thirdly and finally, three narrative samples were collected to increase the length of the narratives by combining the length of all three stories. Cole, Mills and Dale (1989) stated that multiple samples taken over a short period of time were more useful than one sample taken at a single point in time and presented with more lexical information than one sample. Furthermore, Liles (1993) stated that a story of substantial length ensures the narrative is representative of the child’s narrative ability.

STIMULUS MATERIAL
       The narrative samples were elicited using three wordless picture books: Story A :
“ Moonlight “ by Jan Ormerod ; Story B: “The Snowman “ by Ramond Briggs  & Story C: “The angel and the soldier boy “ by Peter Collington .

NARRATIVE LANGUAGE SAMPLE
       A story-retelling task was employed in this study for the following reasons: past research has shown that story retelling is much easier than story generation or creation (Ripich and Griffith, 1988). Story generation requires a person to formulate a story without a model, which can lead to problems in the LILD where their ability to organise events on their own and construct cohesive narrative language may be reduced (Ripich and Griffith, 1988). Story retelling also provides information regarding the more salient features of narrative such as memory for structure and cohesive devices (Ripich and Griffith, 1988).
       An advantage of story retelling is that it allows the child to tell a story that is more complete and has sufficient length (Strong and Shaver, 1991). Merritt and Liles (1989) state that story retelling was found to be more useful as the stories contained more grammatical components and more complete episodes for both LILD and normal language subjects. They also found that story retelling allowed for easier transcription and were more reliably scored than story generation.

LISTENER FAMILIARITY
        In this study the naïve listener condition was adopted. Research has found that when the listener adopts a naïve listener role the narrator will produce a higher number of cohesive devices especially personal references (Liles, 1993). In addition, Purcell and Liles (1992) state that the naïve listener condition also results in longer narratives and more coherent productions than a non-naïve listener condition.

TREATMENT OF DATA
Transcription of the narrative sample
       The researcher transcribed the videotaped discourse samples. The transcribed data was prepared for analysis by bracketing all false starts, repetitions and unintelligible utterances. The bracketed words were not included in the final word count but counted as a separate unit. Rules for counting the number of words were adopted from Strong and Shaver’s (1991) rules for counting words. The transcribed data was segmented according to an “event”.  For the purpose of this study an event was defined as a change in place, character or action.


Methods of analysis
       The narrative samples obtained in this study were analysed according to macrolinguistic organisation. Coherence and cohesion were taken as the measures of suprasentential (macrolinguistic) organisation (Glosser and Deser, 1990; Ulatowska, Freedman –Stern ,Doyel, Macaluso-Haynes & North ,1983).

Coherence
       The narrative samples obtained in this study were analysed according to Labov’s (1977) narrative structure elements that constitute a well –formed narrative:
•    Abstract: This refers to the one or two clauses summarising the whole story.                                                                                                                            •    Orientation: At the beginning of the narrative the time, place, persons and their activity or the situation is defined.                                                                                       •    Complicating action: This is the sequence of events, which is presented chronologically.
•    Evaluation: Various elements are used to express the narrator’s feelings about the characters or events.                                                                                                         •    Resolution: These are one or more statement, which reflect the final events or end the experience.
•    Coda: These are the free clauses that indicate the narrative is finished.

Evaluation
       Evaluation functions in order to specify why the narrative is being told and what the point of the narrative is (Labov, 1977). This is achieved through the narrator’s comments. Evaluation is also a means by which the narrator can attribute emotion to any character or event (Liles, 1993).
       Labov (1977) proposed that evaluation consists of four major categories: intensifiers, comparators, correlatives and explication. Each category consists of a number of subtypes. See Labov (1977) for full description of subtypes.

Cohesion
       According to Halliday and Hasan (1976) cohesion refers to the relations of meaning that exits within the text and define it as a text. Cohesion is expressed through grammar and vocabulary, which is referred to as grammatical cohesion and lexical cohesion, respectively. Grammatical cohesion consists of different types of ties namely: reference ties, substitution ties and ellipsis ties, while conjunction ties are mainly grammatical but contain a lexical component. Lexical ties are another important tie in cohesion (Halliday and Hasan, 1976).
Cohesive Ties
       This refers to the percentages of reference ties, substitution ties, ellipsis ties and conjunction ties and lexical ties used in the text (Halliday and Hasan, 1976).The percentage of each tie was calculated for each narrative sample and statistically analysed. The types of cohesive ties analysed in this study (as described by Halliday and Hasan, 1976) were as follows:
•    Anaphoric reference. This is defined as pronouns that refer to previously identified nouns. For example: …Suzie was at home playing with her dolls.
•    Demonstrative reference: This refers to the use of terms ‘this, that, here and there’. For example: …there was a girl and her mother and her father.
•    Ellipsis: This allows the speaker to reduce redundancy in a message by only encoding the essential elements. For example: …she went back.
•    Substitution: This refers to items other than personal pronouns that replace previously identified elements. For example:..she likes big boats because the little one was small.
•    Conjunctions: These serve a cohesive function as they relate successive utterances to each other. For example:.. they were quite happy because it was quite big.
•    Lexical ties: A lexical item refers back to another lexical item, and is related by having a common referent. For example: …he steals some fruit for the nose and some berries for the eyes. The items fruit and berries are a lexical tie as well as nose and eyes.
Cohesive adequacy
       Cohesive adequacy refers to the percentages of complete, incomplete, and erroneous ties (Liles, 1985). The definitions adopted for this study for complete, incomplete and erroneous ties were as follows (Liles et al 1995):
•    Complete tie: A complete tie is determined when the cohesive item that the tie refers to is easily identifiable and can be defined with no ambiguity.
•    Incomplete tie: A tie is incomplete if the item referred to by the cohesive marker is not given in the text.
•    Erroneous tie: A tie is erroneous if the cohesive marker refers to an ambiguous or erroneous item.

CONTENT AND CLARITY RATINGS
       To supplement the objective analysis of this study, a rating system was devised to evaluate the “content “ and “clarity” ratings of the subjects’ narrative discourse samples.
       This rating system also allows the researcher to compare the objective analysis with the subjective evaluation of the listener (Ulatowska et al. 1983). Ulatowska et al. (1983) suggested that  “content” could be referred to as a rough measure of coherence, and “clarity” as a rough estimate of cohesion. The rating system used in this study was adapted from Ogilvy (1995) and Ulatowska et al.
(1983).
      The content of the narrative was rated by three raters using a three- point scale as a means of answering specific questions. Five questions were used in this section.  Examples include “Do you know what is happening in this story?” and “ Does the sequence of events make sense?” The clarity of the narrative was also rated using a three-point scale. Three questions were used in this section. An example of a question , “ Is it clear to whom the narrator is referring to throughout the story?”

RESULTS AND DISCUSSION
       The findings of this study involve two parts. Firstly the results of the reliability measures and the implications thereof, will be presented. This will be followed by the presentation of the results of the analysis of the LILD and control subjects’ discourse productions.

MEASURES OF RELIABILITY
       Three raters, who are qualified speech and language pathologists, were involved in this study. Each rater received training regarding the methods of analysis.
The subjective content and clarity ratings were rated buy adults who were not qualified speech and language pathologists as formal training was not required.

Interrater reliability
       In this study, inter-observer agreement was determined. This served to assess the extent to which different observers or raters agree that they ‘see’ the same phenomena ( Hux et al. , 1997). According to Hux et al. (1997) inter-observer agreement consists of a number of assumptions. Firstly, the raters must share an understanding of what trait is being rated. Secondly, the raters must be able to determine the occurrence or non- occurrence of what is being measures. Finally, the raters must have a common means of recording the occurrence of the targeted behaviour or trait. Pearson product –moment correlation coefficients were calculated to determine transcription and score reliability.
Transcription reliability
       Transcription accuracy was determined by word- by- word reliability. This kind of transcription agreement is most commonly used in speech pathology research using transcriptions as a means of analysing data. The transcription agreement indices are generally calculated by a word by word agreement procedure. 20% of each sample was randomly selected by two transcribers who  independently transcribed each selected sample of narrative productions (Strong and Shaver, 1991). Word- by -word agreement resulted in percentage agreement ranging from 95% to 98%. Correlation coefficients for interrater transcriptions were greater than .99 for all additional transcriptions, indicating high transcription reliability.
Coder reliability
       An additional coder was trained to analyse a randomly selected 20% of each narrative sample (Strong and Shaver, 1991). The samples were randomly selected by the coder. The coder was required to analyse coherence and cohesion.
       This allowed the researcher to determine whether the method of analysis used was consistent across all narrative samples. The coder was trained for approximately 4 to 6 hours before interrater coding began. The coder randomly selected 20% of each sample and coded each transcription independently. Point -by -point agreement was calculated and an agreement percentage of 95% to 98% for intercoder reliability was obtained. A correlation coefficient of .99 was obtained indicating high intercoder reliability.

Intrarater reliability
Transcription reliability
       15% of each narrative sample was randomly selected for transcribing a second time by the researcher. Word -by -word reliability measures resulted in intrarater reliability for transcription averaging at 97%. A correlation coefficient of .996 was obtained for intrarater transcription reliability.
Coder reliability
       15% of each narrative sample was randomly selected and scored a second time by the researcher. Word –by- word reliability measures resulted in an intrarater reliability for coding averaging at 98%.  The correlation coefficient for intracoder reliability was .96, indicating a high reliability.

Stability coefficients
       The main function of the reliability measures presented in this research was to determine the relationship among scores obtained across the three testing times i.e. to determine whether the measurement was consistent over the testing sessions.

       According to Hux et al. (1997) and Strong and Shaver (1991) calculating correlation coefficients is a common way of assessing the reliability of scores in discourse analysis. For the purpose of this study Pearson product –moment correlation coefficients and an Analysis of Variance (ANOVA) were computed to examine the reliability of the obtained scores. An alpha level of .05 was used for all statistical levels. Correlation coefficients were computed across the three stories to establish the stability of scores of evaluation, cohesive ties and cohesive adequacy across the testing sessions. As the stories were administered with identical instructions, they were considered suitable to be used as parallel forms for the calculation of the coefficients of stability.
Stability coefficients for evaluation
       Analysis of variance (ANOVA) was computed for each subcategory of evaluation. These results are presented in Table 2. No significant scores were obtained indicating there was no fluctuation of these scores across the three testing sessions. Therefore, it can be stated that there was stability across all testing sessions for evaluative elements.

Table 2: ANOVA probability values.
Category    Evaluative element    P values
Intensifiers    Quantifier    .28
    Repetition    .47
    Ritual Utterance    .95
    Prolonged vowel    .53
Comparators    Or clause    .18
    Negative    .77
    Future    -
    Modals    .08
    Quasimodal    .84
    Superlative    .39
    Comparative    .36
    Question    .6
    Imperative    .23

Correlators    Progessives    .2
    Appended participle    .53
    Double apposative    .59
    Double attributive    .94
Explicatives        .65
Total evaluation        .55
Significant score ( p<0,05; n=18)

Stability coefficients for cohesion
Cohesive ties
       Pearson product -moment correlation coefficients were computed to determine the stability of cohesive ties and cohesive adequacy across the three testing sessions. The results are presented in Table 3. The results indicate that the values for demonstrative reference, substitution, conjunction and lexical ties were not significantly different and therefore were stable across the three testing sessions. Values for anaphoric reference and ellipsis were significant and therefore were not found to be stable across testing sessions.

Table 3: Correlation coefficients for cohesive ties.

Cohesive tie    Correlation coefficient
Anaphoric reference    .91*
Demonstrative reference    .19
Ellipsis    .73*
Substitution    .10
Conjunction    .33
Lexical     .35
* significant value (r>0,46 ;p<0,05)

Cohesive adequacy
       Statistical analysis of cohesive adequacy revealed significant correlation for most of the anaphoric reference and lexical ties.
       This shows that these scores were not stable across testing sessions. No significant scores were obtained for demonstrative reference, ellipsis, substitution or conjunctions, therefore indicating that the obtained cohesive adequacy scores, except for anaphoric reference and lexical ties, were stable across the three testing times.  Strong and Shaver (1991) found similar results, as only 5 out of 12 variables tested for stability were significant, and therefore not stable across the testing sessions. The results are presented in Table 4 below.

Table 4:Correlation coefficients for cohesive adequacy.

    Story A vs B    Story B vs C    Story A vs C
Anaphoric reference    complete    -.675*    .583*    -.704*
    incomplete    -.424    -.343    .571*
    erroneous    .162    .957*    .117
Demonstrative reference    complete    .60    -.20    -.39
    incomplete    .00    .00    .00
    erroneous    .00    .00    .00
Substitution    complete    .44    .00    .00
    incomplete    .00    .00    .00
    erroneous    .00    .00    .00
Ellipsis    complete    .00    .00    .00
    incomplete    .00    .00    .00
    erroneous    .00    .00    .00
Conjunction    complete    .00    .00    .00
    incomplete    .00    .00    .00
    erroneous    .00    .00    .00
Lexical tie    complete    -.36    .46    -.47*
    incomplete    -.29    .26    .39
    erroneous    .28    .50*    .92*
* Significant value = (r>0,46 ; p< 0,05)

        In summary, stability coefficients were calculated for evaluation, cohesive ties and cohesive adequacy. The results indicated that only scores for evaluation were completely stable across time and can be considered reliable. As values for anaphoric reference and ellipsis in the analysis of stability of cohesive ties were found to be significant, these can be regarded as being unstable.  
       Similarly in the analysis of stability for cohesive adequacy, anaphoric reference and lexical ties were shown not to be stable across the three testing sessions, therefore these above mentioned scores should be interpreted with due caution as they are not completely reliable.
       
       The findings of the analysis of the LILD and control subjects discourse productions are now presented below.

NARRATIVE DISCOURSE CHARACTERISTICS OF THE LILD AND CONTROL SUBJECTS
Number of events
       The basic unit of ‘event’ was used to segment the transcribed data. The term ‘event ‘ can be argued to indicate the amount of information a child produces in a narrative. Variation across the stories and variation among the subjects regarding the amount of information they produced was found. There were no marked differences between the LILD and control subjects. This finding supports that of Ripich and Griffith (1988), who found that children with learning disabilities did as well as non-learning disabled children, with regard to the amount of information given.

Length of narrative
       The total number of words in each narrative was used as a measure of length of narrative. Repetitions judged as intensifiers were included in the final word count. A wide variation regarding the length of the narrative discourse productions was noted amongst the subjects. The LILD subjects did not appear to present with very short or lengthy stories when compared to the control subjects.
       These findings support research conducted by Culatta, Page and Ellis (1983) who found variations in the number of words of narrative productions in LILD and normal subjects.

Temporal organisation of events
       The clauses in normal narrative productions are typically ordered in temporal sequence (Ulatowska et al.1983). All the subjects’ narratives showed a general preservation of the temporal or chronological sequence of events. Similarly, Ripich and Griffith (1988) found that LILD children are able to organise their narratives according to the structure of the story used for elicitation. LILD3 presented with a confused temporal sequence of events in story C. This could be explained by the length of the story, which consisted of more events and was more complex in structure. The issue of memory in LILD children arises, when examining the problematic temporal sequences in LILD3’s narratives (Isaki and Plante, 1997). Ripich and Griffith (1988) found that as the stories became more complex, the subjects produced more errors in their narratives.
 
Coherence
        Objective analysis of coherence revealed that all subjects, both LILD and control subjects, had the essential elements in their narrative discourse production. These being orientation, complicating action, resolution and evaluation. The findings regarding the organisation of the story grammar concur with those reported by Graybeal (1981 in Ripich and Griffith, 1988) who state that children with learning disabilities are able to organise their stories according to a story grammar.

Evaluation
        A subtle difference was found between the LILD and control subjects regarding the amount of evaluation used, however a marked difference was noted in the types of evaluative elements used across the subjects. The LILD subjects used fewer types of categories and did not include prolonged vowel, question, and superlatives, comparatives, double apposatives and explicatives into their narrative productions. The use of fewer repetitions and negatives was also noted in the LILD discourse productions. Furthermore, they were noted to use more ritual utterances and intensifiers than the control subjects. The control subjects were noted to use these categories in their narrative discourse productions. This finding could be attributed to the greater complexity of these types of elements.

Cohesion
Cohesive ties
       Objective analysis showed that all the subjects did not use all categories of cohesion. Anaphoric reference and lexical ties were the categories used the most by all the subjects. This finding supports Mentis and Prutting (1987) who found that referential and lexical types of cohesion were the most commonly used type in normal narrative productions. All subjects showed developmentally appropriate usage of conjunctions.
        Demonstrative reference and ellipsis were used, but fewer scores were noted across all the subjects’ narratives. Substitution was the least used category, and was only used by C2, C3 and LILD1. Ellipsis and substitution were used less frequently than the other categories such as reference and conjunctions. This finding supports research documented by Rumble and Malan (1990).
Cohesive adequacy
       No marked differences were noted amongst the LILD subjects and the control subjects regarding the cohesive adequacy of their narrative productions. In addition, as noted previously, measures of reliability indicated that these scores were unstable across testing sessions and need to be considered with caution. However, it was of interest that the results from the analysis of the narrative productions revealed erroneous use of anaphoric referencing of LILD2, ellipsis by LILD1 and the lexical ties by LILD3. These erroneous errors may account for the lower clarity ratings obtained by the LILD subjects, This is discussed in more detail below.

CONTENT AND CLARITY RATING
       Content and clarity ratings were developed to compare the results of the objective analysis with a subjective analysis regarding the content and clarity of each subjects’ narrative sample of story B.  In order to determine the interrater reliability for the content and clarity ratings, Pearson product -moment correlation coefficients were computed. A significant value was found between rater 1 and rater 2 and between rater 2 and 3 for cohesion. Therefore these raters presented with high correlation values and these ratings can be considered as being reliable. Most of the subjects received mixed ratings for content except for C1, whom received “high” ratings.
       This could be explained as C1 produced the most accurate narrative according to the model, including all or most of the appropriate information. The other subjects were reported by the raters, to omit or include information, which was not part of the model story.
       The LILD subjects received lower ratings for clarity than the control group. This means that the raters judged the LILD group to produce narratives without the appropriate use of language, referencing and conjunctions. Objective analysis of the LILD narratives regarding cohesion concurs with the clarity ratings as the LILD group tended to use a higher number of incomplete and erroneous cohesive ties.
       LILD2 was shown to use the least percentage of complete anaphoric ties, and was judged to have the lowest rating of cohesion in his narrative of story B. LILD3 produced the least amount of complete lexical ties in story B. This could explain why LILD3 received a lower rating of cohesion than LILD1 and the control group.

SUMMARY OF THE FINDINGS
       A summary of the findings of the subjective and objective analysis are presented below:
•    Amount of information: No difference was noted between the LILD and control subjects.
•    Length of the narrative discourse productions: No variations noted regarding length of narratives produced.
•    Temporal sequence: All subjects showed general preservation of temporal sequence.
•    Coherence: According to the objective and subjective analysis of coherence the LILD and control subjects generally used the necessary items needed for a well –formed narrative.
•    Evaluation: All subjects used necessary elements of evaluation although subtle variations were noted between the LILD and control subjects. Furthermore, the LILD subjects tended to use less complex elements and increased ritual utterances and intensifiers than the control subjects.
•    Cohesion: No marked differences were noted amongst the subjects for cohesive ties. Analysis of cohesive adequacy indicated that there could be difficulties with anaphoric referencing and lexical ties in the LILD subjects’ narrative productions. These subtle deficits noted across all the LILD subjects in the objective analysis were supported by the lower subject ratings received by all three LILD subjects on the clarity ratings.

CONCLUSION AND IMPLICATIONS
       The use of reliable measures in the analysis of narrative discourse productions, in this study, allowed for findings to be adopted with greater confidence. Stability coefficients enable the researcher to know that the scores are representative of the samples produced by the subjects. Results that are not stable across testing sessions need to be considered with caution until further investigation can clarify the reliability of the scores being analysed.
       One of the most important clinical findings of this study was that the LILD subjects were able to tell stories. This supports previous research, which states that LILD children contain the basic ability to produce stories ( Roth and Spekman, 1986, Morris -Friehe and Sanger, 1992).
       These findings further support Liles et al. (1995) as they state that LILD children appear to be using, or attempting to use information typical of a normal developing child regarding the production of narrative discourse.
       

       At times, objective analysis of the coherence and cohesion of all the subjects’ narrative productions did not present with marked differences but rather indicated subtle differences. These findings support past studies which have shown that differences between LILD subjects and control subjects are small and insignificant or none at all (Ripich and Griffith,1988).
       Furthermore, the heterogeneous nature of the learning disabled population has often been highlighted in past literature. The variations found within learning disabled children often overlap with the variations found within the non-learning disabled population  (Morris-Friehe and Sanger, 1992, Nelson, 1993).
       Possible factors thought to have contributed to the findings presented in this study include the nature and complexity of the task employed. The material used in this study could have been too simple, with too few events or lacking the appropriate complexity to tap any existing linguistic deficits. It is believed that other additional discourse tasks and perhaps more complex tasks are required in order to tap the more subtle language deficits found in adolescent language impaired learning disabled populations.
      Another variable that could have influenced the findings in this study is the amount of speech and language therapy the subjects received prior to the study. Ulatowska, Hill, Thompson ,Parsons and Wertz (1998) stated that this could play an important role in this area of narrative discourse research.
       Finally, the amount of variation in the normal controls needs to be highlighted. The results presented in this study not only illustrated the subtle differences in the LILD subjects but also quite noticeably highlighted the individual variations found within the control subjects.
       The differences amongst the LILD and control subjects were not always marked and sometimes the differences related merely to a matter of degree.  Sonnenberg and Penn (1998) state that there is a wide variability across discourse measures that illustrate the features of normal subjects’ discourse that are found to overlap with those of the clinical subjects.
     Sonnenberg and Penn (1998) provided a quote that captures the morality and issues involved in the research of narrative discourse:

       Whose culture, experience, and value system are we to follow in making judgements of propriety? On what basis are we to say that a narration is too much or too little? One person’s embellishment is another person’s ingenuity. One person’s digression is another person’s interesting trip.
                                                                       (Davis, 1993 in Sonnenberg and Penn, 1998)                

FUTURE RESEARCH IMPLICATIONS
       Numerous future research implications have emerged from this study: A need to continue to establish valid and reliable indices of measurement for narrative ability has been highlighted. In addition, further research regarding the normal variation in narrative discourse productions in the non- learning disabled population is needed.
       Without this, interpretation of the analysis of the narrative discourse productions of language impaired population is limited and the diagnosis of language impairments from the analysis of narrative discourse remains questionable.
       As stated previously, language impairments in the learning disabled, become increasingly subtle and harder to identify with age. Thus, little information is available on narrative discourse in adolescent language impaired persons.
       Hence, further investigation in the language impaired learning disabled adolescents would enhance the treatment and rehabilitation of such a disorder. Other aspects, which could be considered for further investigation are the comparison of spoken and written discourse samples in the LILD population and the measurement of discourse across different elicitation tasks. Measurement of discourse ability across various levels of complexity could yield interesting findings and may even provide a means to tap the more subtle deficits noted in older LILD children’s narratives.

REFERENCES
Applebee (1978). The Child’s Concept of Story: Ages Two to Seventeen. The University Chicago Press, Chicago.
Briggs, R. (1978). The Snowman.  Penguin Group, London.
Bruner, J. (1990). Acts of meaning. Cambridge: Harvard University.
Culatta, B., Page, J. & Ellis, J. (1983). Story retelling as a communicative performance screening tool. Language, Speech and Hearing Services in Schools, 14, 66-74.
Cole, K., Mills, P. & Dale, P. (1989). Examination of test-retest and split-half reliability measures derived from language samples of young handicapped children. Language, Speech and Hearing Services in Schools, 20, 259-268.
Collington, P. (1987). The Angel and the Soldier Boy. Methuesen Children, London.
Glosser, G. and Deser, T. (1990). Patterns of discourse production among
    neurological patients with fluent language disorders. Brain and Language, 40,
    67-88.

Halliday, M. and Hasan, R. (1976). Cohesion in English. Longman Group Ltd.,
    London.
Hux, K., Sanger, D., Ried, R. & Maschka, A. (1997). Discourse analysis procedures: reliability issues. Journal of Communicative Disorders, 30,133-150.
 Isaki, E. and Plante, E. (1997). Short-term and working memory differences in
     language/learning disabled and normal adults. Journal of Communication  
     Disorders, 30, 427-437.
Klein, H. (1991). Children with specific learning difficulties: the role of the speech therapist. In Snowling, M.(Ed.) Children’s  Written Language Difficulties. NFER- Nelson Publishing Company Ltd., Oxford.
Labov, W. (1977). Language in the Inner City: Studies in the Black English
     Vernacular. Basil Blackwell, Oxford.
Liles, B. (1985).Cohesion in the narratives of normal and language -disordered
     children. Journal of Speech and Hearing Research, 28, 123-133.
Liles, B. (1993). Narrative discourse in children with language disorders and   
     children with normal language : a critical review of the literature. Journal of  
     Speech and Hearing Research, 36, 868-882.
Liles, B., Duffy, R., Merritt, D. & Purcell, S. (1995). Measurement of narrative
     discourse ability in children with language disorders. Journal of Speech and
      Hearing Research, 38, 415-425.
Mentis, M. and Prutting, C. (1987). Cohesion in the discourse of normal and   
     head- injured adults. Journal of Speech and Hearing Research, 30, 88-98.
Merritt, D. and Liles, B. (1989). Narrative analysis: clinical applications of story generation and story retelling. Journal of Speech and Hearing Disorders, 54, 429-438.
Morris -Friehe, M. and Sanger, D. (1992). Language samples using three story elicitation tasks and maturation effects. Journal of Communication Disorders, 25, 107-124.
Nelson, N. (1993). Childhood Language Disorders in Context: Infancy through adolescence. MacMillan Publishing Co., U.S.A.
Ogilvy, D. (1995). Assessment of the Linguistic and Cognitive Abilities in the Learning Disabled Deaf Population. Published Doctorate of Arts Thesis, University of the Witwatersrand, Johannesburg.
Ormerod, J. (1982). Moonlight. Penguin Group, London.
Paul, R. and Smith, R.(1993). Narrative skills in 4-year-olds with normal, impaired and late-developing language. Journal of Speech and Hearing Research, 36,592-598.
Purcell,S. and Liles, B. (1992). Cohesion repairs in the narratives of normal -     
     language and language -disordered school-age children. Journal of Speech and
      Hearing Research, 3, 354-362.
Ripich , D. and Griffith, P. (1988). Narrative abilities of children with learning disabilities and non-disabled children: story structure, cohesion and propositions. Journal of Learning Disabilities, 21(3),165-173.
Roth, F. and Spekman, N. (1986). Narrative discourse: spontaneously generate
      stories of learning –disabled and normally achieving students. Journal of
      Speech and Hearing Disorders, 51, 8 –23.
Rumble, S. and Malan, K. (1990). The development of a framework for assessing developing conversational skills. The South African Journal of Communication Disorders, 37,21-26.
Strong, C. and Shaver, J. (1991). Stability of cohesion in the spoken narrative of language impaired and normally developing school-aged children. Journal of Speech and Hearing Research, 34, 95 –11.
Spinelli, F. and Ripich, D. (1991.) Discourse and education. In Ripich, D. and Spinelli, F. (Eds.). School Discourse Problems. Singular Publishing Group Inc. San Diego.
Sonnenberg, B. and Penn, C. (1998). What is Abnormal Discourse? Some Cautions for the Aphasia Therapist.  Poster presentation at the 8th International Aphasia Rehabilitation Conference, Kwa Maritane, Pilanesberg National Park, South Africa.
Ulatowska, H., Freedman –Stern, R. ,Doyel, A., Macaluso-Haynes, S. & North, A. (1983). Production of narrative discourse in aphasia, Brain and Language, 19, 317-334.
Ulatowska, H., Hill, C. ,Thompson, J., Parsons, S. & Wertz, R. (1998). Discourse Productions in African –Americans with Aphasia. Presentation at the 8th International Aphasia Rehabilitation Conference, Kwa Maritane, Pilanesberg National Park, South Africa.
Wiig, E. and Semel, E. (1986). Language Disabilities in Children and Adolescents.      
     Charles E. Merrill Publishing Company, Ohio.