\PassOptionsToPackage{unicode=true}{hyperref} % options for packages loaded elsewhere
\PassOptionsToPackage{hyphens}{url}
%
\documentclass[]{article}
\usepackage{lmodern}
\usepackage{amssymb,amsmath}
\usepackage{ifxetex,ifluatex}
\usepackage{fixltx2e} % provides \textsubscript
\ifnum 0\ifxetex 1\fi\ifluatex 1\fi=0 % if pdftex
  \usepackage[T1]{fontenc}
  \usepackage[utf8]{inputenc}
  \usepackage{textcomp} % provides euro and other symbols
\else % if luatex or xelatex
  \usepackage{unicode-math}
  \defaultfontfeatures{Ligatures=TeX,Scale=MatchLowercase}
\fi
% use upquote if available, for straight quotes in verbatim environments
\IfFileExists{upquote.sty}{\usepackage{upquote}}{}
% use microtype if available
\IfFileExists{microtype.sty}{%
\usepackage[]{microtype}
\UseMicrotypeSet[protrusion]{basicmath} % disable protrusion for tt fonts
}{}
\IfFileExists{parskip.sty}{%
\usepackage{parskip}
}{% else
\setlength{\parindent}{0pt}
\setlength{\parskip}{6pt plus 2pt minus 1pt}
}
\usepackage{hyperref}
\hypersetup{
            pdftitle={CSci 658: Software Language Engineering Domain Specific Languages},
            pdfauthor={H. Conrad Cunningham},
            pdfborder={0 0 0},
            breaklinks=true}
\urlstyle{same}  % don't use monospace font for urls
\setlength{\emergencystretch}{3em}  % prevent overfull lines
\providecommand{\tightlist}{%
  \setlength{\itemsep}{0pt}\setlength{\parskip}{0pt}}
\setcounter{secnumdepth}{0}
% Redefines (sub)paragraphs to behave more like sections
\ifx\paragraph\undefined\else
\let\oldparagraph\paragraph
\renewcommand{\paragraph}[1]{\oldparagraph{#1}\mbox{}}
\fi
\ifx\subparagraph\undefined\else
\let\oldsubparagraph\subparagraph
\renewcommand{\subparagraph}[1]{\oldsubparagraph{#1}\mbox{}}
\fi

% set default figure placement to htbp
\makeatletter
\def\fps@figure{htbp}
\makeatother

\usepackage{caption}
\DeclareCaptionLabelFormat{nolabel}{}
\captionsetup{labelformat=nolabel}

\title{CSci 658: Software Language Engineering\\
Domain Specific Languages}
\author{\textbf{H. Conrad Cunningham}}
\date{\textbf{2 April 2018}}

\begin{document}
\maketitle

{
\setcounter{tocdepth}{4}
\tableofcontents
}
Copyright (C) 2016, 2017, 2018, \href{http://www.cs.olemiss.edu/~hcc}{H.
Conrad Cunningham}\\
Professor of \href{https://www.cs.olemiss.edu}{Computer and Information
Science}\\
\href{http://www.olemiss.edu}{University of Mississippi}\\
211 Weir Hall\\
P.O. Box 1848\\
University, MS 38677\\
(662) 915-5358

\textbf{Advisory}: The HTML version of this document requires use of a
browser that supports the display of MathML. A good choice as of April
2018 is a recent version of Firefox from Mozilla.

TODO:

\begin{itemize}
\tightlist
\item
  Finish missing sections
\item
  Structure the document better
\item
  Integrate example external and internal DSLs with this document with
  whatever language(s) are needed
\item
  Better integrate the various lists of guidelines for construction of
  DSLs. Better tie these to concrete examples
\item
  Consider monadic DSLs in Haskell, macro systems, Template Haskell,
  staged metaprogramming, etc.
\end{itemize}

\hypertarget{domain-specific-languages-dsls}{%
\section{Domain Specific Languages
(DSLs)}\label{domain-specific-languages-dsls}}

\hypertarget{what-are-dsls}{%
\subsection{What are DSLs?}\label{what-are-dsls}}

\hypertarget{motivation}{%
\subsubsection{Motivation}\label{motivation}}

Few computer science graduates will design and implement a
general-purpose programming language during their careers. However, many
graduates will design and implement---and all likely will
use---special-purpose languages in their work. These special-purpose
languages are often called
\href{http://en.wikipedia.org/wiki/Domain-specific_language}{domain-specific
languages} {[}Wikipedia{]}.

Paul Hudak describes a \emph{domain-specific language} (or \emph{DSL})
as ``a programming language tailored to a particular application
domain'' {[}\href{localcopy/HudakModularDSLTools.pdf}{Hudak 1998}{]},
that is, to a particular kind of problem.

\href{https://en.wikipedia.org/wiki/General-purpose_programming_language}{\emph{General-purpose
languages (GPLs)}}, such as Java, Python, C, and Haskell, seek to be
broadly applicable across many domains. They can, in theory, compute any
function that is computable by a finite procedure; they are said to be
\href{https://en.wikipedia.org/wiki/Turing_completeness}{\emph{Turing
complete}}.

DSLs might be Turing complete, but often they are not. DSLs are
\emph{little languages}
{[}\href{localcopy/BentleyLittleLanguages.pdf}{Bentley 1986}{]} that
``trade generality for expressiveness''
{[}\href{localcopy/WhenDSL.pdf}{Mernik 2005}{]}.

Ideally, a DSL should enable experts in an application area to program
without programming---that is, to express the problems they want the
computer to solve using familiar concepts and notations, without having
to master the intricacies of programming in a general-purpose language
{[}\href{localcopy/HudakModularDSLTools.pdf}{Hudak 1998},
\href{localcopy/DSLAnnotatedBib.pdf}{van Deursen 2000}{]}.

\hypertarget{examples}{%
\subsubsection{Examples}\label{examples}}

As discussed in Bentley's classic column on ``Little Languages''
{[}Bentley 1986{]}, the DSL
\href{https://en.wikipedia.org/wiki/Pic_language}{\texttt{pic}} enables
writers to produce line drawings in typeset documents; they can focus on
the layout of the drawings without being required to develop programs in
C, the primary general-purpose language used on Unix. (See Bentley's
column on little languages
{[}\href{localcopy/BentleyLittleLanguages.pdf}{Bentley 1986}{]} for
discussion of this DSL.)

The \texttt{pic} language and tool were built according to the
\href{https://en.wikipedia.org/wiki/Unix_philosophy}{Unix philosophy}.
This approach focuses on a minimalist and modular approach to software
development. The Unix tools have limited functionality but are usually
built to read and write streams of bytes so that tools can be readily
composed using Unix pipes.

Other DSLs on the Unix (or Linux) platform include:

\begin{itemize}
\item
  \href{https://en.wikipedia.org/wiki/AWK}{\texttt{awk}} for extracting
  data and generating reports from streams of text-based data using
  regular expressions to specify the processing
\item
  \href{https://en.wikipedia.org/wiki/Lex_(software)}{\texttt{lex}} for
  generating lexical analyzers
\item
  \href{https://en.wikipedia.org/wiki/Yacc}{\texttt{yacc}} for
  generating parsers
\item
  \href{https://en.wikipedia.org/wiki/Make_(software)}{\texttt{make}}
  for building software from its various sources
\item
  \href{https://en.wikipedia.org/wiki/C_preprocessor}{\texttt{cpp}} for
  preprocessing C programs to include header files, expand macros, and
  support conditional compilation
\item
  \href{http://www.graphviz.org/documentation/}{\texttt{dot}} little
  language for specifying graph structures within the
  \href{http://www.graphviz.org}{Graphviz} set of tools
\end{itemize}

Markup languages are also DSLs.

\begin{itemize}
\item
  \href{https://en.wikipedia.org/wiki/HTML}{HTML} is a DSL for
  formatting documents on the Web.
\item
  \href{https://en.wikipedia.org/wiki/LaTeX}{LaTeX} is a powerful markup
  language used for books and articles (especially in STEM disciplines).
\item
  \href{https://en.wikipedia.org/wiki/Markdown}{Markdown} is a simple
  markup language for documents that may be needed in various formats,
  especially HTML. It is often used by wikis and other websites with
  user-contributed content (e.g.~Wikipedia, GitHub). This document is
  written using the variant of Markdown supported by the
  \href{https://pandoc.org/}{Pandoc} tool.
\item
  \href{https://en.wikipedia.org/wiki/ReStructuredText}{reStructuredText}
  is a simple markup language used mostly in the Python community for
  technical documentation.
\end{itemize}

The designers of a DSL must select relevant concepts, notations, and
processes from the application domain and incorporate them into the DSL
design {[}Hudak 1998{]}.

\hypertarget{definition}{%
\subsubsection{Definition}\label{definition}}

Martin Fowler defines a \emph{domain-specific language} as a ``computer
programming language of limited expressiveness focused on a particular
domain'' {[}Fowler 2011{]}.

He explains these terms as follows:

\begin{itemize}
\item
  As a \textbf{language}, a DSL has \emph{fluency}. The expressiveness
  of the language comes not just from the simple expressions but also
  from how those expressions can be easily composed to form larger
  units.
\item
  As a \textbf{computer programming language}, a DSL is structured so
  that humans can effectively read and write ``programs'' in the
  language and computers can accurately read and ``execute'' the
  instructions. It has a well-defined syntax and semantics.
\item
  Having \textbf{limited expressiveness} means the DSL includes features
  that are needed for its purpose and excludes features that are not
  needed for that purpose.
\item
  Being \textbf{focused on a particular domain} means that the DSL has a
  purpose that is both clearly and narrowly defined.
\end{itemize}

\hypertarget{boundaries}{%
\subsubsection{Boundaries}\label{boundaries}}

What is and is not a DSL is a fuzzy concept. Not all writers agree on
the definition. In these notes, we use Martin Fowler's definition above
to consider whether or not a notation is a DSL.

Fowler suggests that one key characteristic of a DSL is its
\emph{language} nature {[}Fowler 2011{]}.

\begin{itemize}
\item
  The language (DSL) is intended to be read and written by humans as
  well as read by computers.
\item
  The DSL must be \emph{fluent}. That is, the meaning of the language
  comes not just from the individual expressions (i.e., the words) but
  also from how the expressions are composed together (i.e., into
  sentences and paragraphs). It has both a vocabulary and a grammar.
\end{itemize}

Fowler also suggests two other key characteristics of a DSL---its
\emph{limited expressiveness} and its \emph{domain focus} {[}Fowler
2011{]}.

He argues that a DSL should have only those features needed to support
its target domain. A DSL should not attempt to solve all problems for
all users for all time. It should not seek to define an entire software
system. It should instead focus on providing an effective, uncluttered
solution to a specific aspect of the overall system.

Which of these boundaries is important in a particular situation depends
upon the style of the DSL.

\hypertarget{external-and-internal-dsl-syles}{%
\subsection{External and Internal DSL
Syles}\label{external-and-internal-dsl-syles}}

\href{http://www.martinfowler.com/bliki/DomainSpecificLanguage.html}{Fowler}
classifies DSLs into two styles {[}Fowler 2008a, 2011{]}:

\begin{itemize}
\item
  external
\item
  internal
\end{itemize}

Although the terminology is relatively new, the ideas are not.

\hypertarget{external}{%
\subsubsection{External}\label{external}}

An \emph{external DSL} is a language that is different from the main
programming language for an application, but that is interpreted by or
translated into a program in the main language. The external DSL is a
\emph{standalone} language with its own syntax and semantics.

The Unix little languages \texttt{pic}, \texttt{lex}, \texttt{yacc}, and
\texttt{make} exhibit this style. They are separate textual languages
with their own syntax and semantics, but they are processed by C
programs (and may also generate C programs).

External DSLs may use ad hoc techniques (e.g.~hand-coded recursive
descent parsers), parser-generation tools (e.g. \texttt{lex} and
\texttt{yacc} in the C/Unix environment, Happy and Alex on the Haskell
platform, \href{http://www.antlr.org/}{ANTLR} on various platforms), or
parsing libraries (e.g.~Haskell library
\href{https://en.wikipedia.org/wiki/Parsec_(parser)}{Parsec}, Scala
\href{http://www.scala-lang.org/api/2.11.8/scala-parser-combinators/\#package}{parser
combinator} library, Python 3 parser combinator library
\href{https://pypi.python.org/pypi/parsita}{Parsita}, Lua
\href{http://www.inf.puc-rio.br/~roberto/lpeg/lpeg.html}{LPeg} library).

Example external DSLs in the course notes include

\begin{itemize}
\item
  most of the Scala-based
  \href{658lectureNotes.html\#StateMachineDSL}{State Machine (Secret
  Panel)} DSLs (adapted from {[}Fowler 2011{]})
\item
  one of the Lua-based \href{658lectureNotes.html\#LairDSL}{Lair
  Configuration} DSLs (adapted from {[}Fowler 2008b{]})
\item
  three of the Ruby-based \href{658lectureNotes.html\#ReaderDSL}{Reader}
  DSLs (adapted from {[}Fowler 2008a{]})
\end{itemize}

To distinguish between external DSLs and GPLs, Fowler cites the need for
a DSL to limit its features to the minimal set needed for the specific
domain {[}Fowler 2011{]}. (See discussion in Boundaries subsection
above.)

For example, Fowler considers the programming language
\href{https://en.wikipedia.org/wiki/R_(programming_language)}{R} a GPL
not a DSL. Although R has special-purpose features to support
statistical programming, it has a full set of general purpose features
for a wide range of programming tasks.

\hypertarget{internal}{%
\subsubsection{Internal}\label{internal}}

An \emph{internal DSL} transforms the main programming language itself
into the DSL--the DSL is \emph{embedded} in the main language {[}Hudak
1998{]}.

The techniques for constructing internal DSLs vary from language to
language.

The language
\href{https://en.wikipedia.org/wiki/Lisp_(programming_language)}{Lisp}
(which was defined in the 1960s) supports \emph{syntactic macros}, a
convenient mechanism for extending the language by adding
application-specific features that are expanded at compile time. The
Lisp macro approach has been refined and included in languages such as
\href{https://en.wikipedia.org/wiki/Scheme_(programming_language)}{Scheme},
\href{https://en.wikipedia.org/wiki/Clojure}{Clojure}, and
\href{https://en.wikipedia.org/wiki/Elixir_(programming_language)}{Elixir}.

Internal DSLs in the language
\href{https://en.wikipedia.org/wiki/Ruby_(programming_language)}{Ruby}
exploit the language's \emph{flexible syntax}, runtime \emph{reflexive
metaprogramming} facilities, and \emph{blocks} (closures). The Ruby on
Rails web framework includes several such internal DSLs.

Haskell's algebraic data type system has stimulated research on
``embedded'' DSLs for several domains including reactive animation and
music {[}Hudak 1998, 2000{]}.

In object-oriented languages, internal DSLs may also exploit object
structures and subtyping.

Example internal DSLs in the instructor's notes include the following:

\begin{itemize}
\item
  Scala-based \href{658lectureNotes.html\#CompConfigDSL}{Computer
  Configuration DSL} (adapted from {[}Fowler 2011{]})
\item
  Scala-based \href{658lectureNotes.html\#EmailMessageDSL}{Email Message
  Building DSL} (adapted from {[}Fowler 2011{]})
\item
  Most of the (Lua- and Python-based)
  \href{658lectureNotes.html\#LairDSL}{Lair Configuration DSLs} (adapted
  from {[}Fowler 2008b{]})
\item
  Ruby-based \href{658lectureNotes.html\#SurveyDSL}{Survey DSL}
  {[}\href{localcopy/surveyLangFinal.pdf}{Cunningham 2008}{]} (partially
  motivated by the sidebar in {[}Bentley 1986{]})
\item
  Haskell- and Scala-based
  \href{658lectureNotes.html\#SandwichDSL}{Sandwich DSL}
\end{itemize}

To distinguish between an internal DSL and a standard
\href{https://en.wikipedia.org/wiki/Command\%E2\%80\%93query_separation}{\emph{command-query}}
\href{https://en.wikipedia.org/wiki/Application_programming_interface}{\emph{Application
Programmer Interface (API)}}, Fowler cites the need for a DSL to be
fluent {[}Fowler 2011{]}. The command-query API provides the vocabulary;
the DSL provides the grammar for composing the vocabulary ``words'' into
``sentences''.

The implementation of a DSL is often supported by an API with a
\href{https://en.wikipedia.org/wiki/Fluent_interface}{\emph{fluent
interface}}, a vocabulary of operations designed to be composed smoothly
into larger operations.

\hypertarget{shallow-and-deep-embeddings-of-internal-dsls}{%
\subsection{Shallow and Deep Embeddings of Internal
DSLs}\label{shallow-and-deep-embeddings-of-internal-dsls}}

The difference between shallow and deep embeddings of an internal DSL
concerns the relationship between the implementations of a DSL's syntax
and its semantics.

\hypertarget{shallow-embedding}{%
\subsubsection{Shallow embedding}\label{shallow-embedding}}

In a \emph{shallow embedding} of an internal DSL, the implementation's
types and data structures directly representa single interpretation of
the semantics of the domain but do not represent the syntactic structure
of the domain objects.

For example, the regular expression package from the Thompson Haskell
textbook {[}Thompson 2011{]}, section 12.3, is a shallow embedding of
the regular expression concept. It models the semantics but not the
syntax of regular expressions. It uses functions to represent the
regular expressions and higher order functions (combinators) to combine
the regular expressions in valid ways.

Similarly, the Scala-based
\href{658lectureNotes.html\#CompConfigDSL}{Computer Configuration} and
\href{658lectureNotes.html\#EmailMessageDSL}{Email Message Building}
DSLs and the Lua-based \href{658lectureNotes.html\#LairDSL}{Lair} DSLs
are relatively shallow embeddings of the DSLs.

The advantage of a shallow embedding is that it provides a simple
implementation of the semantics of the domain. It is usually
straightforward to modify the semantics by adding new operations. If
these capabilities are all that one needs, then a shallow embedding is
convenient.

A disadvantage is that it is sometimes difficult to relate what happens
during execution to the syntactic structure of the program, especially
when errors occur.

\hypertarget{deep-embedding}{%
\subsubsection{Deep embedding}\label{deep-embedding}}

In a \emph{deep embedding} of an internal DSL, the implementation's
types and data structures model both the syntax and semantics of the
domain. That is, it represents the domain objects using \emph{abstract
syntax trees (ASTs)}. These can be interpreted in multiple ways as
needed.

For example, section 19.4 of the Thompson Haskell textbook {[}Thompson
2011{]} redesigns the regular expression package as a deep embedding. It
introduces types that represent the syntactic structure of the regular
expressions as well as their semantics.

The advantage of a deep embedding is that, in addition to manipulating
the semantics of the domain, one can also manipulate the syntactic
representation of the domain objects. The syntactic representation can
be analyzed, transformed, and translated in a many ways that are not
possible with a shallow embedding.

As an example, consider the deep embedding of the regular expression
DSL. It can enable replacement of one regular expression by an
equivalent simpler one, such as replacing \texttt{(a*)*} by \texttt{a*}.

Of course, the disadvantages of deep embedding are that they are more
complex to develop, understand, and modify than shallow embeddings.

\hypertarget{expression-language}{%
\subsubsection{Expression Language}\label{expression-language}}

What about the Expression Language (
\href{http://www.cs.olemiss.edu/~hcc/csci450/notes/ExprLang/10ExprLangSynSem.html}{language
definition},
\href{http://www.cs.olemiss.edu/~hcc/csci450/notes/ExprLang/11ExprLangParsing.html}{parsing},
\href{http://www.cs.olemiss.edu/~hcc/csci450/notes/ExprLang/12ExprLangCompiling.html}{compiling}
) discussed in a separate case study?

The concrete syntax and semantics of the Expression Language is
different from its host language Haskell, so at that level it is an
external DSL. The parsers recognize valid arithmetic expressions in the
input text and create an appropriate abstract syntax tree inside the
Haskell program. The abstract syntax tree differs from the textual
arithmetic expression and from its parse tree.

However, the abstract syntax tree does capture the essential aspects of
the syntax. And the abstract syntax tree itself can be considered a deep
embedding of an internal DSL for the abstract syntax. For the remainder
of the processing of the expression, the abstract syntax tree preserves
the important syntactic (structural) features of the arithmetic
expressions.

The Expression Language case study transformed the abstract syntax trees
by simplifying them (and by generating their symbolic derivatives). The
case study also translated the expressions to instruction sequences for
a Stack Virtual Machine.

The accompanying Sandwich DSL case study (in
\href{SandwichDSL/Haskell/SandwichDSL.html}{Haskell} and
\href{SandwichDSL/Scala/SandwichDSL_Scala.html}{Scala}) gives another
example of how one can create a deeply embedded DSL in a simple
situation.

\hypertarget{dsls-used-to-produce-this-document}{%
\subsection{DSLs Used to Produce This
Document}\label{dsls-used-to-produce-this-document}}

When we look around, we can see DSLs everywhere!

Consider how I create this document on DSLs. I write the text with the
text editor Emacs, which includes a number of extensions, perhaps
written in DSL(s). I indicate the document's structure using a DSL,
\href{https://pandoc.org/}{Pandoc's dialect} {[}MacFarlane 2018{]} of
the markup language
\href{https://en.wikipedia.org/wiki/Markdown}{Markdown}. If I add
mathematical notation to the document, I write these in a subset of the
\href{https://en.wikipedia.org/wiki/LaTeX}{LaTeX} DSL. I then execute
the \href{1\%3Chttp://www.pandoc.org\%3E}{\texttt{pandoc}} tool on the
input file.

Pandoc (which itself is written in Haskell) reads the Markdown input,
converts it to an abstract syntax tree (AST) internally, and then writes
an appropriate HTML (a DSL) output file (that you are likely reading). I
also direct Pandoc to write a LaTeX (a DSL) output file, on which I
execute the tool \texttt{pdflatex} to create a PDF document. I could
generate documents in other standard formats such as Microsoft Word's
\texttt{.docx} format (essentially a DSL) and EPUB (a DSL).

For the HTML output, I direct Pandoc to generate
\href{https://en.wikipedia.org/wiki/MathML}{MathML}, a standard DSL in
the XML family that describes mathematical expressions for display on
the Web. (It is currently supported by the FireFox browser but not all
others, which is why I recommend viewing these documents with FireFox.)

If I wish to include a drawing of a graph structure in the document, I
may express the graph in the \texttt{dot} language supported by the
Graphviz package. Then I can translate it into a Scalable Vector
Graphics (svg) drawing, which is encoded with another dialect of XML.

I wrote the original version of this document directly in HTML before I
started using Pandoc. Other documents used in my courses were written
originally using LaTeX or Word. In some case, I used Pandoc to convert
these documents to Markdown, which gave me the starting point for my
recent changes.

To add a new input format to Pandoc requires a new \emph{reader} program
that can \emph{parse} the input and generate an appropriate abstract
syntax tree.

To add a new output format to Pandoc requires a new \emph{writer}
program that can access the abstract syntax tree and generate
appropriately formatted output.

Pandoc's abstract syntax tree is made available to writers using either
Haskell algebraic data types or
\href{https://en.wikipedia.org/wiki/JSON}{JSON} (JavaScript Object
Notation) structures.

JSON is an external DSL in that it uses a subset of JavaScript's
concrete syntax to express the structure of the data. But, because it is
JavaScript code, it also defines an equivalent internal data structure,
so it also has aspects of an internal DSL. Pandoc could use a JSON
Schema to define the supported format. A JSON Schema is a JSON document
with a specific format (a DSL) that defines the format of other JSON
documents (other DSLs).

The whole conversion process in Pandoc revolves around the abstract
syntax tree. Pandoc enables users to write their own filters that
transform one Pandoc AST to another.

Thus my workflow uses many DSLs directly or indirectly.

\hypertarget{possible-advantages-of-using-dsls}{%
\subsection{Possible Advantages of Using
DSLs}\label{possible-advantages-of-using-dsls}}

Fowler and others give several possible advantages for using DSLs. These
include:

\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\item
  \emph{DSLs can facilitate communication between domain experts and the
  programmers.} {[}Fowler 2011{]}

  A DSL may be a small, simple language that can express important
  aspects of the domain in a manner that nonprogrammers can \emph{read}
  and understand.

  In some cases, users may be able to \emph{write} the DSL programs,
  allowing them to adapt the application to their specific needs without
  the intervention of programmers. But designing a DSL that users can
  write effectively is considerably harder than one they can read
  effectively.

  This is an attack on the effects of the essential complexity of
  software development {[}Brooks 1987{]}.
\item
  \emph{DSLs can help encode domain knowledge in concise, concrete forms
  readable by humans.} {[}Spinellis 2001{]}

  This seeks to mitigate the problem of deciding what to build,
  lessening the effects of the essential complexity of software
  development {[}Brooks 1987{]}. It also enables this knowledge to be
  reused more readily.
\item
  \emph{DSLs and their implementations can increase opportunities for
  software reuse.} {[}Mernik 2005{]}

  An implementation of a DSL may generate code that incorporate key data
  types, software architectures, algorithms, and other domain concepts
  and processes. These are reused from one use of the DSL to another.

  Software reuse is an attack on the effects of the essential complexity
  of software development {[}Brooks 1987{]}.
\item
  \emph{DSLs can improve programmer productivity.} {[}Fowler 2011{]}

  Using a DSL for some narrow, well-defined aspect, programmers can code
  the computation more quickly and reliably with the DSL than directly
  in the GPL.

  For example, many GPLs have sublanguages for specifying regular
  expressions. In most cases, programmers can use these regular
  expression DSLs more productively than programming the pattern
  recognition directly in the GPL. Also, a mature implementation of the
  DSL will likely be more reliable than a new implementation directly in
  the GPL.
\item
  \emph{DSLs can help shift the execution context between compile time
  and runtime.} {[}Fowler 2011{]}

  On the one hand, if some aspect of an application needs to be more
  flexible, we might replace a description in the GPL program
  (e.g.~populating a complex data structure) with a description in a DSL
  that is interpreted at runtime.

  On the other hand, if some aspect of an application needs to be more
  efficient, we might replace a description of the runtime configuration
  code with a description in a DSL that is compiled into efficient GPL
  code.
\item
  \emph{DSLs can enable convenient use of alternative computational
  models that are not natively supported by the GPL used in the overall
  application.} {[}Fowler 2011{]}

  For example, some aspect of the computation might better be expressed
  as a finite state machine, decision table, dependency network,
  rule-based system, etc. We can embed such computations within
  traditional imperative or object-oriented programs by
  designing\textless{} appropriate DSLs.
\item
  \emph{DSLs can promote porting aspects of the application code to a
  different execution platform.} {[}Ward 1994{]}

  The front end phases of the DSL need not be changed, just the back end
  that does code generation or interpretation.

  Note: Fowler argues this is a benefit of having a semantic model, not
  necessarily a DSL.
\end{enumerate}

\hypertarget{possible-disadvantages-of-using-dsls}{%
\subsection{Possible Disadvantages of Using
DSLs}\label{possible-disadvantages-of-using-dsls}}

Fowler also identifies several possible disadvantages of using DSLs.
These include:

\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\item
  \emph{DSLs can contribute to the \textbf{language cacophony}.}
  {[}Fowler 2011{]}

  If many, different, difficult-to-learn languages are used in an
  application or an organization, then this creates considerable work
  for the software developers to learn them all.

  Fortunately, DSLs are usually much easier to learn than GPLs. Also the
  abstractions represented in the DSLs likely will need to be present in
  the libraries, APIs, documentation, and tools regardless. Thus using
  DSLs might not incur as much language learning as it first appears.
\item
  \emph{DSLs can be costly to build and maintain.} {[}Fowler 2011{]}

  A DSL is usually built on top of a library, API, or software
  framework. Designing, implementing, and maintaining the DSL will
  require some additional work.

  Fortunately, DSLs are usually simple and once the developers master
  the language development tools, designing and implementing a DSL can
  be done without a huge investment.

  As with any tool development or acquisition, the software development
  organization must decide if the possible benefits are greater than the
  costs.
\item
  \emph{DSL can contributes to the \textbf{ghetto language} problem.}
  {[}Fowler 2011{]}

  If a language uses many languages that are used nowhere else, then it
  can become difficult to recruit staff.

  This can be a significant problem for use of a GPL. (However, some
  software development organizations choose languages outside the
  mainstream so they can attract the more aggressive staff willing to
  take on interesting technical challenges and use leading edge tools.)

  But DSLs should be small and limited to a narrow domain, so the
  problem should not be as significant as for GPLs. The organization
  should guard against rampant ``mission creep'' for its DSLs.
\item
  \emph{DSLs can sometimes lead to narrow thinking.} {[}Fowler 2011{]}

  The intention of DSL development is to open up the developers to using
  whatever abstractions are appropriate to the domain, rather than those
  that are convenient in the GPL.

  Organizations that use DSLs should avoid falling back into the same
  trap with their DSLs. They should develop appropriate new DSLs when
  needed rather than use an existing DSL with inappropriate
  abstractions.
\end{enumerate}

\hypertarget{designing-dsls}{%
\subsection{Designing DSLs}\label{designing-dsls}}

In a chapter in the functional programming notes, we examine
\emph{families} of related functions to define generic, higher-order
functions to capture the computational patterns for each family. We seek
to raise the level of abstraction in our programs.

Design of DSLs is similar, except that we seek to design a language to
express the family members in some application domain rather than design
a higher-order function.

\hypertarget{scv-analysis}{%
\subsubsection{SCV analysis}\label{scv-analysis}}

We should first analyze the domain systematically and then use the
results to design an appropriate DSL syntax and semantics. We analyze
the domain using \emph{Scope-Commonality-Variability (SCV) analysis}
{[}\href{localcopy/CoplienCommonalityVariability.pdf}{Coplien 1998}{]}
and produce four outputs.

\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\item
  \emph{scope} -- the boundaries of the domain. That is, identify what
  we must address and what we can ignore.
\item
  \emph{terminology} -- the definitions of the specialized terms, or
  concepts, relevant to the domain.
\item
  \emph{commonalities} -- the aspects of the domain that do not change
  from one application to another within the domain. We sometimes call
  these the \emph{frozen spots}.
\item
  \emph{variabilities} -- the aspects of the domain that may change from
  one application to another within the domain. We sometimes call these
  the \emph{hot spots}.
\end{enumerate}

TODO: Expand the explanation (e.g., include model) to include more of
the ideas from {[}Coplien 1998{]} and other sources. Perhaps include
example such as the SurveyDSL design from {[}Cunningham 2008{]}.

In the SCV analysis, we must seek to identify all the implicit
assumptions in the application domain. These implicit assumptions need
to be made explicit in the DSL's design and implementation.

We use the SCV analysis to guide our choices for elements of the DSL
design {[}Mernik 2005, Thibault 1999, Cunningham 2008{]}. The scope
focuses our attention on what we are trying to accomplish. The
terminology and commonalities suggest the DSL statements and constructs.
The commonalities also suggest the semantics of the constructs and the
nature of the underlying computational model. The variabilities
represent syntactic elements to which the DSL programmer can assign
values.

\hypertarget{dsl-design-guidelines}{%
\subsubsection{DSL design guidelines}\label{dsl-design-guidelines}}

TODO: Better integrate the Karsai, Freeman, and other lists of
guidelines.

Karsai et al {[}\href{localcopy/Design_Guidelines_DSLs.pdf}{Karsai
2009}{]} identifies 26 guidelines important for DSL design, grouping
them into 5 categories. The paper focuses on design of external DSLs.
Here we expand their guidelines to include internal DSLs.

\begin{enumerate}
\def\labelenumi{\Alph{enumi}.}
\item
  Language purpose guidelines (What purposes to satisfy)

  \begin{enumerate}
  \def\labelenumii{\arabic{enumii}.}
  \item
    \emph{Identify language uses early.}

    In Fowler's terminology, we must identify the domain and what uses
    the language will have within the domain. We must carefully define
    the scope within the SCV analysis.
  \item
    \emph{Ask questions.}

    What group of users will write the DSL programs? will read them?
    will deploy them for execution? Etc. What are each group's purposes?
    What does each group need to be able to understand and use the DSL
    successfully? Can we simply the DSL further?
  \item
    \emph{Make the language consistent.}

    Avoid surprises. Keep the DSL narrowly focused on its purposes. A
    DSL feature should contribute to the purposes or be omitted. All
    features should be based on a cohesive set of concepts.
  \end{enumerate}
\item
  Language realization guidelines (How to implement)

  \begin{enumerate}
  \def\labelenumii{\arabic{enumii}.}
  \setcounter{enumii}{3}
  \item
    \emph{Decide carefully whether to use a text-based external DSL, a
    graphical external DSL, or an internal DSL hosted in some particular
    language.}

    For the DSL's identified domain, uses, and user groups, what are the
    advantages and disadvantages of each approach? What tools are
    available to support the DSL design, implementation, and use.
  \item
    \emph{Compose existing languages where possible.}

    Can parts of the new DSL's uses be handled by already implemented
    languages and tools? If so, we can avoid the time-consuming and
    error-prone work of designing and implementing a whole new DSL. We
    can combine the existing languages, embed them within a new ``glue''
    language, extend an existing language with a few new features, etc.

    In the ``Little Languages'' paper, Bentley describes how the
    processor for the external DSL \texttt{chem} generates a program in
    the DSL \texttt{pic}. The processor for \texttt{pic} itself
    generates a program in the DSL \texttt{troff}. These ``filter''
    programs are then connected using pipes in the Unix shell (a DSL).
    Furthermore, the implementation of \texttt{pic} specifies the
    lexical analysis and parsing phases using the DSLs \texttt{lex} and
    \texttt{yacc} and defines the overall build process using the DSL
    \texttt{make}.

    Consider an internal DSL such as the Survey DSL. It includes many
    existing features of the host language (Ruby) in the new DSL.
  \item
    \emph{Reuse existing language definitions.}

    Even if the implementation of an existing language cannot be reused,
    we can consider reusing its definition. This saves effort in
    language design and it may leverage the users' knowledge of the
    existing language.

    For example, the Pandoc Markdown dialect embeds LaTeX's widely-known
    mathematical notation to specify mathematical symbols and
    expressions within the text.

    Sometimes building a DSL entails embedding an existing API within a
    fluent interface to implement an internal DSL or devising a similar
    textual notation to form an external DSL {[}Mernik 2005{]}. Or
    perhaps we reenvision a textual or internal DSL as a graphical DSL.
  \item
    \emph{Reuse existing type systems.}

    A language's type system is probably the most difficult to design
    well and implement robustly. A new type system can also be difficult
    for users to learn to use effectively. Thus, by reusing an existing
    type system that the users may know, DSL developers can both make
    their work more efficient and the new DSL easier to understand and
    use.

    An internal DSL selects from and builds on the host language's
    existing type system.
  \end{enumerate}
\item
  Language content guidelines (What features to include)

  \begin{enumerate}
  \def\labelenumii{\arabic{enumii}.}
  \setcounter{enumii}{7}
  \item
    \emph{Reflect only necessary domain concepts.}

    Which artifacts (or objects) from the domain must we capture to
    satisfy the DSL purposes? Which properties of those artifacts? Can
    we leave out the other artifacts and purposes? We should discuss
    possible designs with users and incorporate their feedback.
  \item
    \emph{Keep the DSL simple.}

    Make the DSL easy for users to learn and use effectively. If the
    language is too complex, users may just ignore it. Keep the language
    as simple as possible to ensure effective use.

    Simplicity may be especially difficult to achieve in internal DSLs
    because the boundary between the DSL features and the host language
    may not be clear. Seek to crisply define this boundary.

    The next three guidelines help us achieve simplicity.
  \item
    \emph{Avoid unnecessary generality.}

    This is an aspect of keeping the DSL simple. Design only what is
    necessary to solve the problem. Avoid excessive concern about
    generalizing and parameterizing the language beyond what is needed
    initially. It is difficult to predict what generalizations will
    actually prove useful.

    However, a successful DSL is seldom static. It likely must evolve to
    meet the changes in the domain and the expectations of its users.
    Design the DSL so that it can be extended with new capabilities in
    the future.

    An SCV analysis can reveal important ``variabilities'' (hot spots)
    that either should be incorporated in the initial design or that may
    become important in the future.
  \item
    \emph{Limit the number of language elements.}

    This is a second aspect of keeping the DSL simple. A language with
    many elements is difficult to learn, use, and implement. It is
    better to have a few elements that can be combined flexibly.

    If the domain and purposes are complex, look for ways to break them
    into a set of smaller problems, solve each subproblem by designing a
    sublanguage, and then combine the sublanguages to solve the larger
    problem. Users can focus on the sublanguages they need to carry out
    their specific tasks.

    Alternatively, determine whether some of the more complex elements
    can be moved from the DSL's core and to a ``library'' that can be
    accessed by DSL programs. Enable users to store their own language
    extensions in the library. This approach enables us to extend the
    functionality of the DSL without changing the core language's
    structure.

    For example, languages like Pascal included I/O statements as a
    language construct. Later languages such as C and Java moved I/O to
    a library.
  \item
    \emph{Avoid conceptual redundancy.}

    This is a third aspect of keeping the DSL simple. If there are many
    possible ways to express the same concept in the DSL, users will
    likely become confused and may not use the DSL effectively. Avoid
    unnecessary redundancy.
  \item
    \emph{Avoid inefficient language elements.}

    A DSL raises the level of abstraction and usually obscures the
    details of how a DSL program is actually executed. However, the DSL
    designers and implementers must ensure that the DSL programs execute
    with acceptable efficiency. Moreover, the user of the DSL must be
    able to understand which DSL programs are more efficient than
    others.
  \end{enumerate}
\item
  Concrete syntax guidelines (How to make the DSL readable)

  \begin{enumerate}
  \def\labelenumii{\arabic{enumii}.}
  \setcounter{enumii}{13}
  \item
    \emph{Adapt existing notations domain experts use.}

    Where possible, build on the formal notation that the domain experts
    already know and use. If this needs to be modified or extended, keep
    the changes close to the style of the existing notation. Familiarity
    will make the DSL easier for domain experts to learn and

    Of course, DSL designers may need to formalize the syntax and
    semantics of informal terminology, notation, and processes to enable
    them to be included in an automated DSL. Tools for processing the
    new DSL may also provide capabilities not present in the current
    practice.
  \item
    \emph{Use descriptive notations.}

    Where possible, choose terms and symbols that suggest the intended
    meaning. Avoid using them in ways that differ significantly from the
    way they are used in the domain or in the general public.

    For example, the symbol \texttt{+} usually denotes addition or some
    similar operation; to use it to denote multiplication would likely
    introduce confusion and make the DSL difficult to learn. Similarly,
    using the keyword \texttt{if} to denote something other than a
    conditional would be confusing.
  \item
    \emph{Make elements distinguishable.}

    A DSL will be read by humans more frequently than written. So, when
    the needs of readers and writers conflict, favor the readers over
    the writers.

    Make different elements appear differently in the displayed form.
    Avoid using a subtle difference in notation, spelling, location,
    size, font, or color as the sole way to distinguish between
    different elements. In most cases, make the DSL accessible to those
    with impaired vision.
  \item
    \emph{Use syntactic sugar appropriately.}

    \emph{Syntactic sugar} refers to elements of a language that do not
    add to the expressiveness of the DSL but which may make it easier to
    read and perhaps easier to parse.

    If used in moderation, syntactic sugar makes the language more
    palatable. But overuse may just make the language fat -\/- more
    verbose and confusing.

    This guideline conflicts with the ``avoid conceptual redundancy''
    guideline above. Designers must balance these concerns.
  \item
    \emph{Permit comments.}

    Although comments do not make the DSL more semantically expressive,
    comments enable those writing DSL programs to explain their design
    decisions to others (or their future selves) who need to understand
    and modify the DSL program.

    In addition, comments can allow information to be passed to tools
    that generate structured documentation on the DSL programs.
    (Consider JavaDoc.)
  \item
    \emph{Provide organizational structures for DSL programs}

    To manage a DSL program that grows large and complex, we need to be
    able to break it up into subprograms. The subprograms may need to be
    organized into a graph structure and managed as a group in an
    archive.

    The DSL and the tools that manage DSL programs should allow a group
    of subprograms to be organized into ``packages'' that can be
    selectively included in other DSL programs.

    This solution seeks to manage complexity of DSL programs by breaking
    them into smaller subprograms. The ``library'' solution suggested
    for the ``limit the number of language elements'' guideline seeks to
    manage the complexity of the language itself.
  \item
    \emph{Balance compactness and comprehensibility.}

    Compact notation is generally efficient to write and process.
    However, it may not be comprehensible to the reader.

    Syntactic sugar can make a DSL more comprehensible, but make it more
    difficult to write. And, if overused, it can make the DSL
    confusingly verbose and thus less comprehensible.

    So we must balance among the factors. As noted above, we should
    normally favor the human reader over the writer.
  \item
    \emph{Use the same style everywhere.}

    If a language has several sublanguages, make all the sublanguages
    similar in style. This makes the various languages easier to
    understand and use as a group.

    For example, it would be confusing for one sublanguage to group
    items with \texttt{\{} and \texttt{\}} and another to use
    \texttt{begin} and \texttt{end} for a similar purpose.

    Of course, this guideline must be balanced with the above guidelines
    suggesting we should ``compose existing languages.'' ``reuse
    existing language definitions,'' and ``adapt existing notations
    domain experts use.''
  \item
    \emph{Identify usage conventions.}

    To keep the DSL definition simple, we generally should not rigidly
    enforce minor syntactic issues such as layout. However, good DSL
    programming style can make a program more comprehensible (and more
    pleasant tor read).

    In parallel with the language definition, we should describe good
    style means for layout, naming conventions, order of elements,
    commenting, etc.
  \end{enumerate}
\item
  Abstract syntax guidelines (How to represent the DSL internally)

  \begin{enumerate}
  \def\labelenumii{\arabic{enumii}.}
  \setcounter{enumii}{22}
  \item
    \emph{Align abstract and concrete syntax}

    This means that:

    \begin{itemize}
    \tightlist
    \item
      elements that differ in concrete syntax should differ in internal
      representations
    \item
      elements that are similar in \emph{meaning} should have similar
      internal representations (e.g., be subclasses of same base class)
    \item
      the internal representation of an element should not be dependent
      upon the context in which the element appears
    \end{itemize}

    The goal is to be able to map the human readable representation of
    the DSL to the internal representation. This makes the execution
    easier to understand and debug. It may enable the generation of
    runtime error messages that tie back to the DSL program. (Remember
    the discussion of shallow versus deep embedding of DSLs.)
  \item
    \emph{Prefer layout that does not affect translation from concrete
    to abstract syntax.}

    This guideline suggests that issues like indentation should not
    affect the semantics. Not all language designers (e.g.~of Python and
    Haskell) agree with this guideline.
  \item
    \emph{Enable modularity.}

    To manage the complexity of the language, the ``limit the number of
    language elements'' guideline suggests providing a library of
    non-primitive elements that extend the core language.

    To manage the complexity of large DSL programs, the ``organizational
    structure for DSL programs'' guideline suggests breaking a program
    into a group of related pieces and storing the pieces in a library
    of packages. This can be done at the DSL source code level.

    This ``enable modularity'' guideline suggests the capability to
    build the internal representation of a DSL program by composing
    separately compiled ``modules'' incrementally.

    Perhaps all of these mechanisms are implemented by a single module
    mechanism or there may separate mechanisms for composing precompiled
    modules, DSL source code packages, and DSL extensions.
  \item
    \emph{Introduce interfaces.}

    An \emph{interface} usually defines a set of operation signatures
    (the name, parameters and their types, and return value type) and
    perhaps constraints on the operation's execution (preconditions and
    postconditions).

    If the modules of a large DSL program have well-defined interfaces,
    then the DSL's module mechanism can check whether the modules
    conform with each other's expectations.

    Interfaces and modules together support an information-hiding
    approach to program development. Each module ``hides'' its internal
    details from the other modules in the system.
  \end{enumerate}
\end{enumerate}

\hypertarget{more-guidelines-for-internal-dsl-design}{%
\subsubsection{More guidelines for internal DSL
design}\label{more-guidelines-for-internal-dsl-design}}

TODO: Better integrate the Karsai, Freeman, and other lists of
guidelines.

Drawing on the experience in designing, implementing, and evolving the
JMock internal DSL (which provides mock objects for testing), Freeman
and Pryce make four recommendations for constructing an internal DSL in
Java {[}\href{localcopy/EvolvingEmbeddedDSLJava.pdf}{Freeman 2006}{]}.
To some extent, these recommendations apply to design of all DSLs.

\begin{enumerate}
\def\labelenumi{\arabic{enumi}.}
\setcounter{enumi}{26}
\item
  \emph{Separate syntax and semantics (interpretation) into separate
  layers.}

  The concern of the syntax layer is to provide a fluent, readable
  language for users familiar with the application domain. These may not
  be programmers--or at least not programmers who are experts in the
  host language.

  The concern of the semantic layer is to provide a correct, efficient,
  and maintainable interpreter for the language. The developers and
  maintainers of this layer are typically experts in the host language
  who can take advantage of the implementation language's capabilities
  and idioms.

  As with most nontrivial software development tasks, mixing these
  concerns can make the implementation difficult to develop, understand,
  and maintain. It is better to translate the syntax to to an
  appropriate semantic model (e.g., an abstract syntax tree) for
  processing, hiding the details of each behind well-designed
  interfaces.
\item
  \emph{Use, and perhaps abuse, the host language and its conventions to
  enable the writing of readable DSL programs.}

  For internal DSLs, the syntax layer may need to violate the
  conventional programming styles and naming conventions of the host
  language to achieve the desired readability and fluency.

  For example, DSLs in object-oriented languages may use
  \href{https://en.wikipedia.org/wiki/Method_chaining}{\emph{method
  chaining}} and
  \href{https://en.wikipedia.org/wiki/Method_cascading}{\emph{method
  cascading}} extensively to achieve the desired
  \href{https://en.wikipedia.org/wiki/Fluent_interface}{fluency}. These
  practices usually discouraged in the usual programming practices.
\item
  \emph{Don't trap the user in the internal DSL.}

  Note: This concern of this guideline is similar to discussion of
  libraries, packages, and modules in the Karsai et al guidelines in a
  previous subsection.

  The DSL should encapsulate its internal implementation details to
  avoid unexpected dependence on implementation features that might
  change over time and to enable naive users to use the DSL safely.

  However, it is difficult to anticipate all possible uses of a DSL over
  time. Thus it is helpful to enable expert users of the host language
  to extend the DSL by providing alternative implementations of key
  abstractions.

  The implementation of the DSL should itself be approached as a
  software family. Although the DSL syntax may look different than the
  host language, the DSL implementation should seek to work seamlessly
  with other host language programs.

  We should track the changes that expert users make. These help
  identify possible future enhancements of the DSL and its
  implementation.
\item
  \emph{Map error reports to the syntax layer rather than to the
  semantics layer, which is hidden from the DSL user.}

  Note: The concern of this guideline is similar to that of Karsai et
  al's ``align concrete and abstract syntax'' guideline in a previous
  subsection.

  Good error reports are critical to a successful DSL. As much as
  possible, errors in both syntax and semantics should be stated in
  terms of the syntactic structure of the specific DSL program. This is
  often difficult to accomplish, but it is important because it is
  unlikely that the DSL's users will be familiar with the internal
  details of the implementation.

  Deep embedding of DSLs can make it easier to trace errors back to
  recognizable syntactic structures.
\end{enumerate}

\hypertarget{conclusion}{%
\subsection{Conclusion}\label{conclusion}}

TODO

\hypertarget{exercises}{%
\subsection{Exercises}\label{exercises}}

TODO

\hypertarget{acknowledgements}{%
\subsection{Acknowledgements}\label{acknowledgements}}

In Fall 2016, I adapted and revised much of this work for possible use
in CSci 450 (Organization of Programming Languages), but I did not use
it that semester. These notes are based, in part, on a previous
HTML-source handout on domain-specific languages for exercises in the
Fall 2014 CSci 450 and Spring 2016 CSci 555 classes. The notes draw
ideas from several of the references listed in the final section.

For a Haskell-based offering of CSci 556 (Multiparadigm Programming) in
Spring 2017, I continued to develop these notes, adding discussion of
the boundaries between DSLs and other computing artifacts, of the
Expression Language case study, and of my use of DSLs in the workflow
for producing this document.

I updated these notes slightly in Summer and Fall 2017 for possible use
in CSci 450, but did not use it that semester.

In Spring 2018, I added discussion of Fowler's definition of DSLs,
advantages and disadvantages of DSLs, and DSL design guidelines (from
Karsai et al) and modified the document for use in CSci 658 (Software
Language Engineering).

I maintain these notes as text in Pandoc's dialect of Markdown using
embedded LaTeX markup for the mathematical formulas and then translate
the notes to HTML, PDF, and other forms as needed. The HTML version of
this document may require use of a browser that supports the display of
MathML.

\hypertarget{references}{%
\subsection{References}\label{references}}

\begin{description}
\tightlist
\item[{[}Bentley 1986{]}]
J. Bentley. \href{localcopy/BentleyLittleLangluages.pdf}{Programming
Pearls: Little Languages}, \emph{Communications of the ACM}, Vol. 29,
No.~8, pp.~711-721, August 1986.
{[}\href{localcopy/BentleyLittleLanguages.pdf}{local}{]}
\item[{[}Brooks 1987{]}]
Frederick P. Brooks. \href{localcopy/NoSilverBullet.pdf}{No Silver
Bullet: Essence and Accident in Software Engineering}, \emph{IEEE
Computer}, Vol. 20, No.~4, 10-19, 1987.
\item[{[}Coplien 1998{]}]
J. Coplien, D. Hoffman, and D. Weiss.
\href{https://3aec1b23-a-eadc3f87-s-sites.googlegroups.com/a/gertrudandcope.com/info/Publications/Mpd/IeeeNov1998/coplien.pdf?attachauth=ANoY7cqjLSFhBTUnDP-C_Cj4kLTGu334X-m5cMpw_ErjA8RasBO0e9_FpIRx1o6SKVk29-QYwWl4YeLTOdspGdoohyn8jRDA7pfG8q2gaXJ6EU8NFkbS0Bs5UCiI09kChAAWrTX4-Qv3S-JxGUzHcDvJyaiY4wdqgWeAyoe-BlDYpmSEmiDuSOobndHFoqaZ4VWUvg-N4n01hKVvk5NEQCeSQBkw7lonmWkwUjWwg0363dJ27ScQVYbrdkEHxZvQPdnq2rRwCAMs\&attredirects=0}{Commonality
and Variability in Software Engineering}, \emph{IEEE Software},
15(6):37--45, November 1998.
{[}\href{localcopy/CoplienCommonalityVariability.pdf}{local}{]}
\item[{[}Cunningham 2008{]}]
H. C. Cunningham. \href{localcopy/surveyLangFinal.pdf}{A Little Language
for Surveys: Constructing an Internal DSL in Ruby}, In \emph{Proceedings
of the ACM SouthEast Conference}, 6 pages, March 2008.
\item[{[}Fowler 2008a{]}]
M. Fowler.
\href{https://www.martinfowler.com/bliki/DomainSpecificLanguage.html}{DomainSpecificLanguage},
Blog posting, 15 May 2008 (accessed 16 January 2018).
\item[{[}Fowler 2008b{]}]
Martin Fowler.
\href{http://media.pragprog.com/titles/twa/martin_fowler.pdf}{One Lair
and Twenty Ruby DSLs}, Chapter 3,
\href{http://pragprog.com/book/twa/thoughtworks-anthology}{\emph{The
ThoughtWorks Anthology: Essays on Software Technology and Innovation}},
The Pragmatic Bookshelf, 2008.
\href{LairDSL/FowlerOneLairTwentyDSLs.pdf}{{[}local chapter}{]}
\item[{[}Fowler 2011{]}]
M. Fowler.
\href{https://www.martinfowler.com/books/dsl.html}{\emph{Domain Specific
Languages}}, Addison Wesley, 2011.
\item[{[}Freeman 2006{]}]
S. Freeman and N. Pryce.
\href{https://pdfs.semanticscholar.org/ff98/e181a0d05fc3ebdee3525e41bdf300c34343.pdf}{Evolving
an Embedded Domain-Specific Language in Java}, In \emph{Companion to the
Conference on Object-Oriented Programming Languages, Systems, and
Applications}, pages 855--865. ACM SIGPLAN, October 2006.
{[}\href{localcopy/EvolvingEmbeddedDSLJava.pdf}{local}{]}
\item[{[}Hudak 1996{]}]
Paul Hudak.
\href{https://pdfs.semanticscholar.org/13ca/0d31ee51fbc07113b3cb7e5467aef5b94183.pdf}{Building
Domain-Specific Embedded Languages}, \emph{ACM Computing Surveys}, Vol.
28, No.~4, p.~196, 1996.
{[}\href{localcopy/HudakBuildingDSLs.pdf}{local}{]}
\item[{[}Hudak 1998{]}]
P. Hudak.
\href{https://pdfs.semanticscholar.org/fde8/80da8d091ff1e0d463db96a3919313cf9709.pdf}{Modular
Domain Specific Languages and Tools}, In P. Devanbu and J. Poulin,
editors, \emph{Proceeding of the 5th International Conference on
Software Reuse (ICSR'98)}, pages 134-142. IEEE, 1998.
{[}\href{localcopy/HudakModularDSLTools.pdf}{local}{]}
\item[{[}Karsai 2009{]}]
Gabor Karsai, Holger Krahn, Claas Pinkernell, Bernhard Rumpe, Martin
Schindler and Steven Voelkel.
\href{https://arxiv.org/pdf/1409.2378}{Design Guidelines for Domain
Specific Languages}, In \emph{Proceedings of OOPSLA Workshop on
Domain-Specific Modeling}, 2009. Also arXiv preprint arXiv:1409.2378,
2014. {[}\href{localcopy/Design_Guidelines_DSLs.pdf}{local}{]}
\item[{[}MacFarlane 2018{]}]
J. MacFarlane and the Pandoc community.
\href{http:///www.pandoc.org}{Pandoc: A Universal Document Converter
Wht}, accessed 17 January 2016.
\item[{[}Mernik 2005{]}]
M. Mernik, J. Heering, and A. M. Sloane.
\href{https://pdfs.semanticscholar.org/bd06/01088d5f217dc136a898f577763df92891cb.pdf}{When
and How to Develop Domain Specific Languages}, \emph{ACM Computing
Surveys}, 37(4):316-344, December 2005.
{[}\href{localcopy/WhenDSL.pdf}{local}{]}
\item[{[}Spinellis 2001{]}]
Diomidis Spinellis.
\href{https://www.spinellis.gr/pubs/jrnl/2000-JSS-DSLPatterns/html/dslpat.html}{Notable
Design Patterns for Domain-Specific Languages}, \emph{Journal of Systems
and Software}, Vol. 56, No.~1, pp.~91-99, 2001.
{[}\href{localcopy/Notable_Patterns_DSL.pdf}{local}{]}{]}
\item[{[}Thibault 1999{]}]
S. Thibault, R. Marlet, and C. Consel.
\href{localcopy/DSLDesignImpleVideo.pdf}{Domain-Specific Languages: From
Design to Implementation--Application to Video Device Driver
Generation}. \emph{IEEE Transactions on Software Engineering},
25(3):363--377, May/June 1999.
\item[{[}Thompson 2011{]}]
S. Thompson. \emph{Haskell: The Craft of Functional Programming}, Third
Edition, Addison Wesley, 2011.
\item[{[}van Deursen 2000{]}]
A. van Deursen, P. Klint, and J. Visser.
\href{https://pdfs.semanticscholar.org/8dc3/b9f1f22277556fe0e52b12216bf6e149801a.pdf}{Domain
Specific Languages: An Annotated Bibliography}, \emph{SIGPLAN Notices},
35(6):26-36, June 2000.
{[}\href{localcopy/DSLAnnotatedBib.pdf}{local}{]}
\item[{[}Ward 199{]}]
M. P. Ward.
\href{http://www.cse.dmu.ac.uk/~mward/martin/papers/middle-out-t.pdf}{Language-Oriented
Programming}, \emph{Software--Concepts and Tools,} Vol. 15, No.~4,
pp.~147-161, 1994.
{[}\href{localcopy/LanguageOrientedProgramming.pdf}{local}{]}
\item[{[}Wikipedia{]}]
Wikipedia.
\href{http://en.wikipedia.org/wiki/Domain-specific_language}{Domain-Specific
Language}, accessed 12 April 2017.
\end{description}

\hypertarget{concepts}{%
\subsection{Concepts}\label{concepts}}

TODO: Update this list

Domain-specific languages (DSLs); language nature; fluency; limited
expressiveness; domain focus; DSLs versus general-purpose programming
languages; DSLs versus APIs; external versus internal DSLs; shallow
versus deep embedding of internal DSLs; use of algebraic data types to
implement DSLs

\end{document}