Haskell
Haskell is a general-purpose, statically typed, purely functional programming language with type inference and lazy evaluation. Haskell pioneered several programming language [|features] including type classes for type-safe operator overloading and monadic input/output. It is named after logician Haskell Curry. Haskell's main implementation is the Glasgow Haskell Compiler.
Haskell's semantics are historically based on those of the Miranda programming language, which served to focus the efforts of the initial Haskell working group. The last formal specification of the language was made in July 2010, while the development of GHC continues to expand Haskell via language extensions.
Haskell is used in academia and industry., Haskell was the 28th most popular programming language by Google searches for tutorials, and made up less than 1% of active users on the GitHub source code repository.
History
After the release of Miranda by Research Software Ltd. in 1985, interest in lazy functional languages grew. By 1987, more than a dozen non-strict, purely functional programming languages existed. Miranda was the most widely used, but it was proprietary software. At the conference on Functional Programming Languages and Computer Architecture in Portland, Oregon, there was a strong consensus that a committee be formed to define an open standard for such languages. The committee's purpose was to consolidate existing functional languages into a common one to serve as a basis for future research in functional-language design.Haskell 1.0 to 1.4
Haskell was developed by a committee, attempting to bring together off-the-shelf solutions where possible.Type classes were first proposed by Philip Wadler and Stephen Blott to address the ad hoc handling of equality types and arithmetic overloading in languages at the time.
In early versions of Haskell up until and including version 1.2, user interaction and input/output were handled by both streams-based and continuation-based mechanisms, which were widely considered unsatisfactory. In version 1.3, monadic IO was introduced, along with the generalisation of type classes to higher kinds. Along with "do notation", which provides syntactic sugar for the type class, this gave Haskell an effect system that maintained referential transparency and was convenient.
Another notable change in early versions was the moving of the "sequential evaluation" operation from a type class to a standard function to make refactoring more practical.
The first version of Haskell was defined in 1990. The committee's efforts resulted in a series of language definitions.
File:Base-classes.svg|thumb|right|Hierarchy of type classes in the Haskell Prelude as of GHC 7.10.
The inclusion of and , and of as intermediate between and, are deviations from the Haskell 2010 standard.
Haskell 98
In late 1997, the series culminated in Haskell 98, intended to specify a stable, minimal, portable version of the language and an accompanying standard library for teaching, and as a base for future extensions. The committee expressly welcomed creating extensions and variants of Haskell 98 via adding and incorporating experimental features.In February 1999, the Haskell 98 language standard was originally published as The Haskell 98 Report. In January 2003, a revised version was published as Haskell 98 Language and Libraries: The Revised Report. The language continues to evolve rapidly, with the Glasgow Haskell Compiler implementation representing the current de facto standard.
Haskell 2010
In early 2006, the process of defining a successor to the Haskell 98 standard, informally named Haskell Prime, began. This was intended to be an ongoing incremental process to revise the language definition, producing a new revision up to once per year. The first revision, named Haskell 2010, was announced in November 2009 and published in July 2010.Haskell 2010 added several well-used and uncontroversial features previously enabled via compiler-specific flags.
- Hierarchical module names. Module names are allowed to consist of dot-separated sequences of capitalized identifiers, rather than only one such identifier. This lets modules be named in a hierarchical manner, although technically modules are still in a single monolithic namespace. This extension was specified in an addendum to Haskell 98 and was in practice universally used.
- The foreign function interface allows bindings to other programming languages. Only bindings to C are specified in the Report, but the design allows for other language bindings. To support this, data type declarations were permitted to contain no constructors, enabling robust nonce types for foreign data that could not be constructed in Haskell. This extension was also previously specified in an Addendum to the Haskell 98 Report and widely used.
- So-called n+''k'' patterns = were no longer allowed. This syntactic sugar had misleading semantics, in which the code looked like it used the
operator, but in fact desugared to code usingand. - The rules of type inference were relaxed to allow more programs to type check.
- Some syntax issues were fixed: pattern guards were added, allowing pattern matching within guards; resolution of operator fixity was specified in a simpler way that reflected actual practice; an edge case in the interaction of the language's lexical syntax of operators and comments was addressed, and the interaction of do-notation and if-then-else was tweaked to eliminate unexpected syntax errors.
- The
LANGUAGEpragma was specified. By 2010, dozens of extensions to the language were in wide use, and GHC provided theLANGUAGEpragma to specify individual extensions with a list of identifiers. Haskell 2010 compilers are required to support theHaskell2010extension and are encouraged to support several others, which correspond to extensions added in Haskell 2010.Future standards
Features
Haskell features lazy evaluation, lambda expressions, pattern matching, list comprehension, type classes and type polymorphism. It is a purely functional programming language, which means that functions generally have no side effects. A distinct construct exists to represent side effects, orthogonal to the type of functions. A pure function can return a side effect that is subsequently executed, modeling the impure functions of other languages.Haskell has a strong, static type system based on Hindley–Milner type inference. Its principal innovation in this area is type classes, originally conceived as a principled way to add overloading to the language, but since finding many more uses.
The construct that represents side effects is an example of a monad: a general framework which can model various computations such as error handling, nondeterminism, parsing and software transactional memory. They are defined as ordinary datatypes, but Haskell provides some syntactic sugar for their use.
Haskell has an open, published specification, and [|multiple implementations exist]. Its main implementation, the Glasgow Haskell Compiler, is both an interpreter and native-code compiler that runs on most platforms. GHC is noted for its rich type system incorporating recent innovations such as generalized algebraic data types and type families. The Computer Language Benchmarks Game also highlights its high-performance implementation of concurrency and parallelism.
An active, growing community exists around the language, and more than 5,400 third-party open-source libraries and tools are available in the online package repository Hackage.
Code examples
A "Hello, World!" program in Haskell :module Main where -- not needed in interpreter, is the default in a module file
main :: IO -- the compiler can infer this type definition
main = putStrLn "Hello, World!"
The factorial function in Haskell, defined in a few different ways :
factorial :: => a -> a
-- Using recursion
factorial n = if n < 2
then 1
else n * factorial
-- Using recursion
factorial 0 = 1
factorial n = n * factorial
-- Using recursion
factorial n
| n < 2 = 1
| otherwise = n * factorial
-- Using a list and the "product" function
factorial n = product
-- Using fold
factorial n = foldl 1
-- Point-free style
factorial = foldr 1. enumFromTo 1
Using Haskell's Fixed-point combinator allows this function to be written without any explicit recursion.
import Data.Function
factorial = fix fac
where fac f x
| x < 2 = 1
| otherwise = x * f
As the Integer type has arbitrary-precision, this code will compute values such as
factorial 100000, with no loss of precision.An implementation of an algorithm similar to quick sort over lists, where the first element is taken as the pivot:
-- Type annotation
quickSort :: Ord a => ->
-- Using list comprehensions
quickSort = -- The empty list is already sorted
quickSort = quickSort -- Sort the left part of the list
++ ++ -- Insert pivot between two sorted parts
quickSort -- Sort the right part of the list
-- Using filter
quickSort =
quickSort = quickSort
++ ++
quickSort
Implementations
All listed implementations are distributed under open source licenses.Implementations that fully or nearly comply with the Haskell 98 standard include:
- The Glasgow Haskell Compiler compiles to native code on many different processor architectures, and to ANSI C, via one of two intermediate languages: C--, or in more recent versions, LLVM bitcode. GHC has become the de facto standard Haskell dialect. There are libraries that work only with GHC. GHC was also distributed with the Haskell platform. GHC features an asynchronous runtime that also schedules threads across multiple CPU cores similar to the Go runtime.
- Jhc, a Haskell compiler written by John Meacham, emphasizes speed and efficiency of generated programs and exploring new program transformations.
- * Ajhc is a fork of Jhc.
- The Utrecht Haskell Compiler is a Haskell implementation from Utrecht University. It supports almost all Haskell 98 features plus many experimental extensions. It is implemented using attribute grammars and is primarily used for research on generated type systems and language extensions.
- The Haskell User's Gofer System is a bytecode interpreter. It was once one of the implementations used most widely, alongside the GHC compiler, but has now been mostly replaced by GHCi. It also comes with a graphics library.
- HBC is an early implementation supporting Haskell 1.4. It was implemented by Lennart Augustsson in, and based on, Lazy ML. It has not been actively developed for some time.
- nhc98 is a bytecode compiler focusing on minimizing memory use.
- * The York Haskell Compiler was a fork of nhc98, with the goals of being simpler, more portable and efficient, and integrating support for Hat, the Haskell tracer. It also had a JavaScript backend, allowing users to run Haskell programs in web browsers.
- Eta and Frege are dialects of Haskell targeting the Java virtual machine.
- Gofer is an educational dialect of Haskell, with a feature called constructor classes, developed by Mark Jones. It is supplanted by Haskell User's Gofer System.
- Helium, a newer dialect of Haskell. The focus is on making learning easier via clearer error messages by disabling type classes as a default.