Tok Pisin

Tok Pisin is a creole language spoken throughout Papua New Guinea. It is an official language of Papua New Guinea and the most widely used language in the country between five to six million speakers. Tok Pisin was developed as a trade pidgin referred to as "New Guinea Pidgin" or "Pidgin English". Urban dwellers in particular often communicate in Tok Pisin and perhaps one million people now use Tok Pisin as a primary language.

Tok Pisin Project is an initiative with funding from Department of Defence, Science and Technology (DST) who want CoEDL Tok Pisin corpus to build up transcribed materials involving up to around 15 hours of recorded naturalistic, conversational materials. There is 11 hours currently in PARADISEC and need another 4 hours or so from other sources transcribed. There are at present no existing transcription and DST want initially spoken corpus as a subset of the overall Tok Pisin corpus. The outcome of the Project is to have faithful transcriptions on the use of the Tok Pisin language to input into the machine learning system to match all sounds against the phonemic transcriptions.


