Universiti Teknologi Malaysia Institutional Repository

The potential contribution of general and specialized corpora to research on Malay and Malaysian English.

Mohd. Don, Zuraidah and Gerry, Knowles (2022) The potential contribution of general and specialized corpora to research on Malay and Malaysian English. LSP International Journal, 9 (2). pp. 85-96. ISSN 2601–002X

[img] PDF
329kB

Official URL: http://dx.doi.org/10.11113/lspi.v9.19469

Abstract

Today’s linguists are increasingly concerned with high-level properties of texts, and tend to work top-down in some branch of discourse analysis, while corpus linguists are concerned with low-level properties such as grammatical class, syntactic constructions and different kinds of text annotation, and tend to work bottom-up. This paper seeks to close the gap, using a general corpus and a specialised corpus. The point of departure is the assumption that a corpus is compiled to study the language of texts in some language for some special purpose beyond the existence of the corpus itself. The particular languages in mind are Malay and Malaysian English. The introduction deals with matters that have to be considered when a corpus project is planned, and with the problems that can arise, some of which have been reported. The methodology section concentrates on the groundwork that has to be done for just about any corpus-based project, and starts with a project undertaken long before computers were invented, and describes the role of computational expertise in modern corpus-based projects. The results section reports some preliminary work on a specialised corpus containing the speeches of Tun Mahathir Mohamed, which attempts to go beyond the groundwork to ascertain objectively what the speeches are about. The paper ends with a combined discussion and conclusion that summarises the content of the paper.

Item Type:Article
Uncontrolled Keywords:Malay,Malaysian English,Corpora,Specialised Corpora,Empirical Methodology,Frequency Word Lists
Subjects:L Education > L Education (General)
Divisions:Language Academy
ID Code:104505
Deposited By: Muhamad Idham Sulong
Deposited On:08 Feb 2024 08:16
Last Modified:08 Feb 2024 08:16

Repository Staff Only: item control page