Improve documentation. Change-Id: Iab20bde3fdf1fee5c6b47c748baae0266769e333 Reviewed-on: https://swiftshader-review.googlesource.com/5782 Reviewed-by: Nicolas Capens <capn@google.com> Tested-by: Nicolas Capens <capn@google.com>

commit: cf83d1686885230e8eae54767e8b6f8e2afa07ab [log] [tgz]
author: Nicolas Capens <capn@google.com> Sat Jul 02 23:41:30 2016 -0400
committer: Nicolas Capens <capn@google.com> Sun Jul 03 04:07:04 2016 +0000
tree: 45f8221ead97847a072b73dcd0eacf2f6c956069
parent: 7e9ba096ded4d2cae93800eec90af184b36c2b96 [diff]
diff --git a/CONTRIBUTING.txt b/CONTRIBUTING.txt
index b717d67..2cedc8e 100644
--- a/CONTRIBUTING.txt
+++ b/CONTRIBUTING.txt

@@ -19,6 +19,8 @@
 ### Code reviews

 All submissions, including submissions by project members, require review.

 

+Information on how to sumbit changes for review is provided in README.md.

+

 ### The small print

 Contributions made by corporations are covered by a different agreement than

 the one above, the


diff --git a/README.md b/README.md
index fbb4fbf..1bf7e65 100644
--- a/README.md
+++ b/README.md

@@ -16,20 +16,23 @@
 Contributing

 -----------------

 

-See CONTRIBUTING.txt for important contributing requirements.

+See [CONTRIBUTING.txt](CONTRIBUTING.txt) for important contributing requirements.

 

-The canonical repository for SwiftShader is hosted at

+The canonical repository for SwiftShader is hosted at:

 https://swiftshader.googlesource.com/SwiftShader

 

-All changes must be reviewed and approved in the Gerrit review tool at

+All changes must be reviewed and approved in the [Gerrit](https://www.gerritcodereview.com/) review tool at:

 https://swiftshader-review.googlesource.com

 

-All changes require a Change-ID tag in the commit message. A commit hook may be used to add this tag automatically, and can be found at:

+Authenticate your account here:

+https://swiftshader-review.googlesource.com/new-password

+

+All changes require a [Change-ID](https://gerrit-review.googlesource.com/Documentation/user-changeid.html) tag in the commit message. A commit hook may be used to add this tag automatically, and can be found at:

 https://gerrit-review.googlesource.com/tools/hooks/commit-msg. To clone the repository and install the commit hook in one go: 

 

     git clone https://swiftshader.googlesource.com/SwiftShader && (cd SwiftShader && curl -Lo `git rev-parse --git-dir`/hooks/commit-msg https://gerrit-review.googlesource.com/tools/hooks/commit-msg ; chmod +x `git rev-parse --git-dir`/hooks/commit-msg)

 

-Changes are uploaded to Gerrit by performing

+Changes are uploaded to Gerrit by performing:

 

     git push origin HEAD:refs/for/master

 

@@ -43,21 +46,21 @@
 

 Public mailing list: swiftshader@googlegroups.com

 

-Bug tracker: bugs.chromium.org/p/swiftshader

+Bug tracker: https://bugs.chromium.org/p/swiftshader

 

 License

 ----------

 

-The SwiftShader project is licensed under the Apache License Version 2.0. You can find a copy of it in LICENSE.txt.

+The SwiftShader project is licensed under the Apache License Version 2.0. You can find a copy of it in [LICENSE.txt](LICENSE.txt).

 

 Files in the third_party folder are subject to their respective license.

 

 Authors and Contributors

 -----------------------------------

 

-The legal authors for copyright purposes are listed in AUTHORS.txt.

+The legal authors for copyright purposes are listed in [AUTHORS.txt](AUTHORS.txt).

 

-CONTRIBUTORS.txt contains a list of names of individuals who have contributed to SwiftShader. If you're not on the list, but you've signed the Google CLA and have contributed more than a formatting change, feel free to request to be added.

+[CONTRIBUTORS.txt](CONTRIBUTORS.txt) contains a list of names of individuals who have contributed to SwiftShader. If you're not on the list, but you've signed the [Google CLA](https://cla.developers.google.com/clas) and have contributed more than a formatting change, feel free to request to be added.

 

 Disclaimer

 ---------------


diff --git a/docs/Reactor.md b/docs/Reactor.md
index 263c1bf..2a2b2dd 100644
--- a/docs/Reactor.md
+++ b/docs/Reactor.md

@@ -32,13 +32,13 @@
 

 Specialization in general is the use of a more optimal routine that is specific for a certain set of conditions. For example when sorting two numbers it is faster to swap them if they are not yet in order, than to call a generic quicksort function. Specialization can be done statically, by explicitly writing each variant or by using metaprogramming to generate multiple variants at static compile time, or dynamically by examining the parameters at run-time and generating a specialized path.

 

-Because specialization can be done statically, sometimes aided by metaprogramming, the ability of a JIT-compiler to do it at run-time is often disregarded. Specialized benchmarks show no advantage of JIT code over static code. However, having a specialized benchmark does not take into account that a typical real-world application deals with many unpredictable conditions. Systems can have one core or several dozen cores, and many different ISA extensions. This alone can make it impractical to write fully specialized routines manually, and with the help of metaprogramming it results in code bloat. Worse yet, any non-trivial application has a layered architecture in which lower layers (e.g. framework APIs) know very little or nothing about the usage by higher layers. Various parameters also depend on user input. Run-time specialization can have access to the full context in which each routine executes, and although the optimization contribution of specialization for a single parameter is small, the combined speedup can be huge. As an extreme example, interpreters can execute any kind of program in any language, but by specializing for a specific program you get a compiled version of that program. But you dont need a full-blown language to observe a huge difference between interpretation and specialization through compilation. Most applications process some form of list of commands in an interpreted fashion, and even the series of calls into a framework API can be compiled into a more efficient whole at run-time.

+Because specialization can be done statically, sometimes aided by metaprogramming, the ability of a JIT-compiler to do it at run-time is often disregarded. Specialized benchmarks show no advantage of JIT code over static code. However, having a specialized benchmark does not take into account that a typical real-world application deals with many unpredictable conditions. Systems can have one core or several dozen cores, and many different ISA extensions. This alone can make it impractical to write fully specialized routines manually, and with the help of metaprogramming it results in code bloat. Worse yet, any non-trivial application has a layered architecture in which lower layers (e.g. framework APIs) know very little or nothing about the usage by higher layers. Various parameters also depend on user input. Run-time specialization can have access to the full context in which each routine executes, and although the optimization contribution of specialization for a single parameter is small, the combined speedup can be huge. As an extreme example, interpreters can execute any kind of program in any language, but by specializing for a specific program you get a compiled version of that program. But you don't need a full-blown language to observe a huge difference between interpretation and specialization through compilation. Most applications process some form of list of commands in an interpreted fashion, and even the series of calls into a framework API can be compiled into a more efficient whole at run-time.

 

-While the benefit of run-time specialization should now be apparent, JIT-compiled languages lack many of the practical advantages of static compilation. JIT-compilers are very constrained in how much time they can spend on compiling the bytecode into machine code. This limits their ability to even reach parity with static compilation, let alone attempt to exceed it by performing run-time specialization. Also, even if the compilation time was not as constrained, they cant specialize at every opportunity because it would result in an explosive growth of the amount of generated code. Theres a need to be very selective in only specializing the hotspots for often recurring conditions, and to manage a cache of the different variants. Even just selecting the size of the set of variables that form the entire condition to specialize for can get immensely complicated.

+While the benefit of run-time specialization should now be apparent, JIT-compiled languages lack many of the practical advantages of static compilation. JIT-compilers are very constrained in how much time they can spend on compiling the bytecode into machine code. This limits their ability to even reach parity with static compilation, let alone attempt to exceed it by performing run-time specialization. Also, even if the compilation time was not as constrained, they can't specialize at every opportunity because it would result in an explosive growth of the amount of generated code. There's a need to be very selective in only specializing the hotspots for often recurring conditions, and to manage a cache of the different variants. Even just selecting the size of the set of variables that form the entire condition to specialize for can get immensely complicated.

 

-Clearly we need a manageable way to benefit from run-time specialization where it would help significantly, while still resorting to static compilation for anything else. A crucial observation is that the developer has expectations about the applications behavior, which is valuable information which can be exploited to choose between static or JIT-compilation. One way to do that is to use an API which JIT-compiles the commands provided by the application developer. An example of this is an advanced DBMS which compiles the query into an optimized sequence of routines, each specialized to the data types involved, the sizes of the CPU caches, etc. Another example is a modern graphics API, which takes shaders (a routine executed per pixel or other element) and a set of parameters which affect their execution, and compiles them into GPU-specific code. However, these examples have a very hard divide between what goes on inside the API and outside. You cant exchange data between the statically compiled outside world and the JIT-compiled routines, unless through the API, and they have very different execution models. In other words they are highly domain specific and not generic ways to exploit run-time specialization in arbitrary code.

+Clearly we need a manageable way to benefit from run-time specialization where it would help significantly, while still resorting to static compilation for anything else. A crucial observation is that the developer has expectations about the application's behavior, which is valuable information which can be exploited to choose between static or JIT-compilation. One way to do that is to use an API which JIT-compiles the commands provided by the application developer. An example of this is an advanced DBMS which compiles the query into an optimized sequence of routines, each specialized to the data types involved, the sizes of the CPU caches, etc. Another example is a modern graphics API, which takes shaders (a routine executed per pixel or other element) and a set of parameters which affect their execution, and compiles them into GPU-specific code. However, these examples have a very hard divide between what goes on inside the API and outside. You can't exchange data between the statically compiled outside world and the JIT-compiled routines, unless through the API, and they have very different execution models. In other words they are highly domain specific and not generic ways to exploit run-time specialization in arbitrary code.

 

-This is becoming especially problematic for GPUs, as they are now just as programmable as CPUs but you can still only command them through an API. Attempts to disguise this by using a single language, such as C++AMP and SYCL, still have difficulties expressing how data is exchanged, dont actually provide control over the specialization, they have hidden overhead, and they have unpredictable performance characteristics across devices. Meanwhile CPUs gain ever more cores and wider SIMD vector units, but statically compiled languages dont readily exploit this and cant deal with the many code paths required to extract optimal performance. A different language and framework is required.

+This is becoming especially problematic for GPUs, as they are now just as programmable as CPUs but you can still only command them through an API. Attempts to disguise this by using a single language, such as C++AMP and SYCL, still have difficulties expressing how data is exchanged, don't actually provide control over the specialization, they have hidden overhead, and they have unpredictable performance characteristics across devices. Meanwhile CPUs gain ever more cores and wider SIMD vector units, but statically compiled languages don't readily exploit this and can't deal with the many code paths required to extract optimal performance. A different language and framework is required.

 

 Concepts and Syntax

 -------------------

@@ -73,7 +73,7 @@
 assert(result == 1);

 ```

 

-Note that ```Function<>``` objects are relatively heavyweight, since they have the entire JIT-compiler behind them, while ```Routine``` objects are lightweight and merely provide storage and lifetime management of generated routines. So we typically allow the ```Function<>``` object to be destroyed (by going out of scope), while the ```Routine``` object is retained until we no longer need to call the routine. Hence the distinction between then and the need for a couple of lines of boilerplate code.

+Note that ```Function<>``` objects are relatively heavyweight, since they have the entire JIT-compiler behind them, while ```Routine``` objects are lightweight and merely provide storage and lifetime management of generated routines. So we typically allow the ```Function<>``` object to be destroyed (by going out of scope), while the ```Routine``` object is retained until we no longer need to call the routine. Hence the distinction between them and the need for a couple of lines of boilerplate code.

 

 ### Arguments and Expressions
commit	cf83d1686885230e8eae54767e8b6f8e2afa07ab	[log] [tgz]
author	Nicolas Capens <capn@google.com>	Sat Jul 02 23:41:30 2016 -0400
committer	Nicolas Capens <capn@google.com>	Sun Jul 03 04:07:04 2016 +0000
tree	45f8221ead97847a072b73dcd0eacf2f6c956069
parent	7e9ba096ded4d2cae93800eec90af184b36c2b96 [diff]