The bibliography contains publications and PhD theses directly related to the use of corpora and corpus linguistic techniques in discourse studies – or, seen from a different perspective, corpus studies focusing on discourse issues. The approach may be termed ‘corpus-based’, ‘corpus-driven’ or ‘corpus-assisted’ (to the extent that these distinctions are useful or, indeed, meaningful), and the analysis itself can be ‘critical’ or otherwise (again, the same reservations apply). There are no restrictions regarding language. Please note that studies focusing on discourse or lexicogrammatical features are relevant here only if they explicitly discuss how these features contribute to discourse meaning or style, or to the creation of particular discourses.

Whatever the label attached to their approach, publications are relevant when they …

  • report on research using corpora and corpus techniques to examine discourse meaning/style and/or particular discourses, or
  • discuss the compilation of corpora, or the development of corpus metrics/techniques, for discourse studies.

The publications are ordered chronologically, starting with a list of introductions and overviews. I opted for a chronological order, because it helps provide a sketch of beginnings and developments in terms of the focus and methodology of discourse-oriented corpus studies and the volume of publications. For example, it can be seen that attention to discourse features dates back to the beginnings of corpus linguistics. Also, the beginning of using corpus techniques in ‘critical discourse studies’ can be traced back toLeech & Fallon (1992), which also seems to be the first study to use an exploratory keyness analysis — four years before the terms keyness and keyness analysis were introduced.

The DOCS Bibliography (in PDF) is available here:


Gabrielatos, C. (2021, November 26) Bibliography of Discourse-Oriented Corpus Studies. Retrieved from [URL].

Contributing to the DOCS bibliography

I would be grateful for any suggestions regarding additions or corrections. I am particularly interested in studies published in the 1980s and early 1990s, or the last few years. If any of the publications are open-access (or have open-access versions) please send me the URL. If you are sending details of your own work, please provide formatted citations. I would also be happy to link your name to your webpage, blog, or twitter.

Please send updates, correction, or suggestions to [email protected], using ‘DOCS bibliography’ as the email subject.

The following colleagues have contributed so far:

Katja Seljak Adimora, David Ayers, Monika Bednarek, Georgia Carr, Isabella Chiari, Jersus Colmenares, Eniko Csomay, Mark Dang-Anh, Kim Ebensgaard, Irene Elmerot, Bacem Abdalah Essam, Saira Fitzgerald, Sharon Glaas, Daniel Ginsberg, Kat Gupta, Sally Hunt, Ersilia Incelli, Vigdis Jensen, Joanna Kopaczyk, Ramesh Krishnamurthy, Chinger Enrique Zapata Leal, Alon Lischinsky, Stef Maci, Katrin Menzel, Jim Mischler, Alan Partington, Robert Poole, Alison Sealey, Bazil Solomon, Stefania Spina, Nic Subtirelu, Arnaud Vincent.

Special thanks to Charlotte Taylor for allowing me to incorporate entries from her bibliography of corpus-based work on Language and Gender.

The following Edge Hill University GTAs have assisted with updates. I am grateful toProf. Alyson Brown for allocating funds to this purpose.

  • August – December 2021: Katia Adimora
  • March 2020 – January 2021: Dan Malone
  • March 2015 – May 2016: Laura Burnham


If you have made use of the DOCS bibliography in your research or teaching, or if you have referred to it in a publication, presentation, or blog (or if you have come across such mentions), please let me know ([email protected]).