live-bootstrap/parts.rst

1093 lines
37 KiB
ReStructuredText
Raw Normal View History

.. sectnum:: :start: 0
.. SPDX-FileCopyrightText: 2022 Dor Askayo <dor.askayo@gmail.com>
.. SPDX-FileCopyrightText: 2021 Andrius Štikonas <andrius@stikonas.eu>
.. SPDX-FileCopyrightText: 2021 Paul Dersey <pdersey@gmail.com>
2023-02-11 00:41:05 +00:00
.. SPDX-FileCopyrightText: 2021-23 fosslinux <fosslinux@aussies.space>
2021-05-18 12:37:21 +01:00
.. SPDX-FileCopyrightText: 2021 Melg Eight <public.melg8@gmail.com>
.. SPDX-License-Identifier: CC-BY-SA-4.0
bootstrap-seeds
===============
This is where it all begins. We start with the two raw binary seeds ``hex0-seed`` and ``kaem-optional-seed``.
First, we use those seeds to rebuild themselves.
Note that all early steps before ``mes`` are part of `stage0-posix <https://github.com/oriansj/stage0-posix>`_.
hex0
====
``hex0`` is fairly trivial to implement and for each pair of hexadecimals characters it outputs a byte. We have also added two types of line comments (``#`` and ``;``) to create a well commented lines like:
.. code:: scheme
# :loop_options [_start + 0x6F]
39D3 ; cmp_ebx,edx # Check if we are done
74 14 ; je !loop_options_done # We are done
83EB 02 ; sub_ebx, !2 # --options
In the first steps we use initial ``hex0`` binary seed to rebuild ``kaem-optional`` and ``hex0`` from their source.
``hex0`` code is somewhat tedious to read and write as it is basically a well documented machine code. We have to manually calculate all jumps in the code.
``hex0`` can be approximated with: ``sed 's/[;#].*$//g' $input_file | xxd -r -p > $output_file``
kaem-optional
=============
``kaem-optional`` is a trivial shell that can read list of commands together with their command line arguments from a file and executes them. It also supports line comments but has no other features.
hex1
====
This is the last program that has to be written in ``hex0`` language. ``hex1`` is a simple extension of ``hex0`` and adds a single character labels and allows calculating 32-bit offsets from current position in the code to the label. ``hex1`` code might look like:
.. code:: scheme
:a #:loop_options
39D3 ; cmp_ebx,edx # Check if we are done
0F84 %b ; je %loop_options_done # We are done
83EB 02 ; sub_rbx, !2 # --options
hex2
====
``hex2`` is our final hex language that adds support for labels of arbitrary length. It also allows accessing them via 8, 16, 32-bit relative addresses (``!``, ``@``, ``%``) and via 16-bit or 32-bit (``$``, ``&``) absolute addresses:
.. code:: scheme
:loop_options
39D3 ; cmp_ebx,edx # Check if we are done
74 !loop_options_done ; je8 !loop_options_done # We are done
83EB 02 ; sub_ebx, !2 # --options
catm
====
``catm`` allows concatenating files with ``catm output_file input1 input2 ... inputN``. This allows us to distribute common code in separate files. We will first use it to append ELF header to ``.hex2`` files. Before this step the ELF header had to be included in the source file itself.
M0
==
The ``M0`` assembly language is the simplest assembly language you can create that enables the creation of more complicated programs. It includes only a single keyword: ``DEFINE`` and leverages the language properties of ``hex2`` along with extending the behavior to populate immediate values of various sizes and formats.
Thus ``M0`` code looks like:
.. code:: bash
DEFINE cmp_ebx,edx 39D3
DEFINE je 0F84
DEFINE sub_ebx, 81EB
:loop_options
cmp_ebx,edx # Check if we are done
je %loop_options_done # We are done
sub_ebx, %2 # --options
cc_x86
======
The ``cc_x86`` implements a subset of the C language designed in ``M0`` assembly. It is a somewhat limited subset of C but complete enough to make it easy to write a more usable C compiler written in the C subset that ``cc_x86`` supports.
2023-01-21 01:31:09 +00:00
At this stage we start using `M2libc <https://github.com/oriansj/M2libc/>`_ as our C library. In fact, ``M2libc`` ships two versions of C library. There is a single-file library that contains just enough to build ``M2-Planet`` and there is a full version that is rather well-featured.
M2-Planet
=========
This is the only C program that we build with ``cc_x86``. `M2-Planet <https://github.com/oriansj/M2-Planet>`_ supports a larger subset of C than ``cc_x86`` and we are somewhat closer to C89 (it does not implement all C89 features but on the other hand it does have some C99 features). ``M2-Planet`` also includes a very basic preprocessor, so we can use stuff like ``#define``, ``#ifdef``.
``M2-Planet`` is also capable of using full ``M2libc`` C library that has more features and optimizations compared to bootstrap version of ``M2libc``.
``M2-Planet`` supports generating code for various architectures including ``x86``, ``amd64``, ``armv7``, ``aarch64``, ``riscv32`` and ``riscv64``. Up until this point bootstrap has been very architecture specific. From now on we still have platform specific bits of code but they are usually handled as conditionals in the same application rather than having completely different applications.
mescc-tools
===========
Now we build ``blood-elf`` used to generate debug info, C version of ``hex2`` (also called ``hex2``) and C version of ``M0`` called ``M1``. These are more capable than their platform specific hex counterparts and are fully cross-platform. Thus we can now have the whole toolchain written in C.
Then we rebuild ``mescc-tools`` again, so all our tools are using new toolchain written in C.
Finally, we build `kaem` which is a more capable version of `kaem-optional` and adds support for variables, environmental variables, conditionals and aliases. It also has various built-ins such as `cd` and `echo`.
M2-Mesoplanet
=============
``M2-Mesoplanet`` is a preprocessor that is more capable than ``M2-Planet`` and supports ``#include`` statements. It can also launch compiler, assembler and linker with the correct arguments, so we don't need to invoke them
manually.
At the moment it is only used to build ``mescc-tools-extra``.
M2-Planet
=========
We rebuild ``M2-Planet`` with ``M2-Planet``.
From here, we can move on from the lowest level stuff.
mescc-tools-extra
=================
``mescc-tools-extra`` contains some additional programs, namely filesystem
utilities ``cp`` and ``chown``. This allows us to have one unified
2021-06-10 18:01:47 +01:00
directory for our binaries. Futhermore, we also build ``sha256sum``, a
checksumming tool, that we use to ensure reproducibility and authenticity
of generated binaries. We also build initial ``untar``, ``ungz`` and ``unbz2``
2021-06-10 18:01:47 +01:00
utilities to deal with compressed archives.
2022-04-07 00:09:45 +01:00
``/sysa``
2022-05-04 19:32:45 +01:00
=========
2022-04-07 00:09:45 +01:00
We now move into the ``/sysa`` directory. As stage0-posix has no
2021-06-10 18:01:47 +01:00
concept of ``chdir()`` (not added until very late in stage0-posix),
we have to copy a lot of files into the root of the initramfs, making it
2022-04-07 00:09:45 +01:00
very messy. We get into the move ordered directory ``/sysa`` here,
copying over all of the required binaries from ``/``.
2022-05-04 19:32:45 +01:00
mes 0.24
========
GNU ``mes`` is a scheme interpreter. It runs the sister project ``mescc``,
which is a C compiler written in scheme, which links against the Mes C
Library. All 3 are included in this same repository. There are two stages
to this part:
1. Compiling an initial mes using ``M2-Planet``. Note that this is
*only* the Mes interpreter, not the libc or anything else.
2022-05-04 19:32:45 +01:00
2. We then use this to recompile the Mes interpreter as well as building
the libc. This second interpreter is faster and less buggy.
tinycc 0.9.26
=============
``tinycc`` is a minimal C compiler that aims to be small and fast. It
complies with all C89 and most of C99 standards.
First, we compile jannekes fork of tcc 0.9.26 using ``mescc``,
containing 27 patches to make it operate well in the bootstrap
environment and make it compilable using ``mescc``. This is a
non-trivial process and as seen within tcc.kaem has many different parts
within it: a. tcc 0.9.26 is first compiled using ``mescc``. b. The mes
libc is recompiled using tcc (``mescc`` has a non-standard ``.a``
format), including some additions for later programs. c. tcc 0.9.26 is
recompiled 5(!) times to add new features that are required for other
features, namely ``long long`` and ``float``. Each time, the libc is
also recompiled.
tinycc 0.9.27
=============
Now, we compile upstream tcc 0.9.27, the latest release of tinycc, using
the final version of tcc 0.9.26.
From this point onwards, until further notice, all programs are compiled
using tinycc 0.9.27.
Note that now we begin to delve into the realm of old GNU software,
using older versions compilable by tinycc. Prior to this point, all tools
have been adapted significantly for the bootstrap; now, we will be using
old tooling instead.
make 3.82
=========
GNU ``make`` is now built so we have a more robust building system.
``make`` allows us to do things like define rules for files rather than
writing complex kaem scripts.
2022-05-14 19:40:52 +01:00
gzip 1.2.4
==========
``gzip`` is the most common compression format used for software source
code. It is more capble than ``ungz`` from ``stage0-posix`` and also supports
compression.
2022-05-14 12:42:03 +01:00
tar 1.12
========
We build GNU Tar 1.12, the last version compilable with mes libc.
sed 4.0.9
=========
You are most likely aware of GNU ``sed``, a line editor.
patch 2.5.9
===========
``patch`` is a very useful tool at this stage, allowing us to make
significantly more complex edits, including just changes to lines.
bzip2 1.0.8
===========
``bzip2`` is a compression format that compresses more than ``gzip``. It
is preferred where we can use it, and makes source code sizes smaller.
2021-03-17 00:24:37 +00:00
coreutils 5.0
2021-03-17 22:17:11 +00:00
=============
GNU Coreutils is a collection of widely used utilities such as ``cat``,
``chmod``, ``chown``, ``cp``, ``install``, ``ln``, ``ls``, ``mkdir``,
``mknod``, ``mv``, ``rm``, ``rmdir``, ``tee``, ``test``, ``true``, and
many others.
A few of the utilities cannot be easily compiled with Mes C library, so
we skip them.
The ``cp`` in this stage replaces the ``mescc-tools-extra`` ``cp``.
heirloom devtools
=================
``lex`` and ``yacc`` from the Heirloom project. The Heirloom project is
a collection of standard UNIX utilities derived from code by Caldera and
Sun. Differently from the analogous utilities from the GNU project, they
can be compiled with a simple ``Makefile``.
bash 2.05b
==========
GNU ``bash`` is the most well known shell and the most complex piece of
software so far. However, it comes with a number of great benefits over
kaem, including proper POSIX sh support, globbing, etc.
Bash ships with a bison pre-generated file here which we delete.
Unfortunately, we have not bootstrapped bison but fortunately for us,
heirloom yacc is able to cope here.
flex 2.5.11
===========
``flex`` is a tool for generating lexers or scanners: programs that
recognize lexical patters.
Unfortunately ``flex`` also depends on itself for compiling its own
scanner, so first flex 2.5.11 is compiled, with its scanner definition
2022-05-04 19:32:45 +01:00
manually modified so that it can be processed by lex from the Heirloom
project (the required modifications are mostly syntactical, plus a few
workarounds to avoid some flex advanced features).
tcc 0.9.27 (patched)
2022-12-25 10:18:12 +00:00
====================
2022-12-25 10:30:15 +00:00
We recompile ``tcc`` with some patches needed to build musl.
musl 1.1.24
===========
``musl`` is a C standard library that is lightweight, fast, simple,
free, and strives to be correct in the sense of standards-conformance
and safety. ``musl`` is used by some distributions of GNU/Linux as their
C library. Our previous Mes C library was incomplete which prevented us
from building many newer or more complex programs.
``tcc`` has slight problems when building and linking ``musl``, so we
apply a few patches. In particular, we replace all weak symbols with
strong symbols and will patch ``tcc`` in the next step to ignore
duplicate symbols.
tcc 0.9.27 (musl)
=================
We recompile ``tcc`` against musl. This is a two stage process. First we
build tcc-0.9.27 using tcc-0.9.26 that itself links to Mes C library but produces
binaries linked to musl. Then we recompile newly produced tcc with
itself. Interestingly, tcc-0.9.27 linked against musl is self hosting.
musl 1.1.24 (tcc-musl)
======================
We now rebuild ``musl`` with the just built ``tcc-musl``, which fixes a
number of bugs, particularly regarding floats, in the first ``musl``.
tcc 0.9.27 (musl v2)
====================
Now that we have a fixed ``musl``, we now recompile ``tcc`` as ``tcc``
uses floats extensively.
2021-03-11 18:12:52 +00:00
sed 4.0.9
=========
``sed`` is rebuilt against musl.
2021-03-11 18:12:52 +00:00
bzip2 1.0.8
===========
``bzip2`` is rebuilt unpatched with the new tcc and musl fixing issues
with reading files from stdin that existed in the previous build.
m4 1.4.7
========
``m4`` is the first piece of software we need in the autotools suite,
flex 2.6.4 and bison. It allows macros to be defined and files to be
generated from those macros.
2022-05-04 01:09:24 +01:00
flex 2.6.4
==========
We recompile unpatched GNU ``flex`` using older flex 2.5.11. This is
again a two stage process, first compiling flex using ``scan.c`` (from
``scan.l``) created by old flex, then recompile ``scan.c`` using the new
version of flex to remove any buggy artifacts from the old flex.
bison 3.4.1
===========
GNU ``bison`` is a parser generator. With ``m4`` and ``flex`` we can now
bootstrap it following https://gitlab.com/giomasce/bison-bootstrap. Its
a 3 stage process:
1. Build bison using a handwritten grammar parser in C.
2. Use bison from previous stage on a simplified bison grammar file.
3. Build bison using original grammar file.
Finally we have a fully functional ``bison`` executable.
grep 2.4
========
GNU ``grep`` is a pattern matching utility. Is is not immediately needed
but will be useful later for autotools.
diffutils 2.7
=============
``diffutils`` is useful for comparing two files. It is not immediately
needed but is required later for autotools.
coreutils 5.0
=============
``coreutils`` is rebuilt against musl. Additional utilities are built
2021-03-24 19:24:59 +00:00
including ``comm``, ``expr``, ``dd``, ``sort``, ``sync``, ``uname`` and
``uniq``. This fixes a variety of issues with existing ``coreutils``.
coreutils 6.10
==============
We build ``date``, ``mktemp`` and ``sha256sum`` from coreutils 6.10 which are
either missing or don't build correctly in 5.0. Other utils are not built at
this stage.
2021-02-28 20:53:20 +00:00
gawk 3.0.4
==========
``gawk`` is the GNU implementation of ``awk``, yet another pattern
matching and data extraction utility. It is also required for autotools.
perl 5.000
==========
Perl is a general purpose programming language that is especially
suitable for text processing. It is essential for autotools build system
because automake and some other tools are written in Perl.
Perl itself is written in C but ships with some pre-generated files that
need perl for processing, namely ``embed.h`` and ``keywords.h``. To
bootstrap Perl we will start with the oldest Perl 5 version which has
the fewest number of pregenerated files. We reimplement two remaining
perl scripts in awk and use our custom makefile instead of Perls
pre-generated Configure script.
At this first step we build ``miniperl`` which is ``perl`` without
support for loading modules.
perl 5.003
==========
We now use ``perl`` from the previous stage to recreate pre-generated
files that are shipped in perl 5.003. But for now we still need to use
handwritten makefile instead of ``./Configure`` script.
perl 5.004_05
=============
Yet another version of perl; the last version buildable with 5.003.
perl 5.005_03
=============
More perl! This is the last version buildable with 5.004. It also
introduces the new pregenerated files ``regnodes.h`` and
``byterun.{h,c}``.
perl 5.6.2
==========
2021-02-27 23:56:00 +00:00
Even more perl. 5.6.2 is the last version buildable with 5.005.
2021-02-21 21:09:47 +00:00
autoconf 2.52
2021-02-27 23:38:12 +00:00
=============
2021-02-21 21:09:47 +00:00
GNU Autoconf is a tool for producing ``configure`` scripts for building, installing and
packaging software on computer systems where a Bourne shell is available.
At this stage we still do not have a working autotools system, so we manually install
``autoconf`` script and replace a few placeholder variables with ``sed``.
2021-03-17 00:24:37 +00:00
Autoconf 2.52 is the newest version of ``autoconf`` that does not need ``perl``, and hence
a bit easier to install.
automake 1.6.3
==============
GNU Automake is a tool for automatically generating Makefile.in files. It is another major
part of GNU Autotools build system and consists of ``aclocal`` and ``automake`` scripts.
2021-03-17 00:24:37 +00:00
We bootstrap it using a 3 stage process:
2021-03-17 00:24:37 +00:00
1. Use ``sed`` to replace a few placeholder variables in ``aclocal.in`` script.
Then we manually install ``aclocal`` script and its dependencies.
2. Patch ``configure.in`` to create ``automake`` file but skip ``Makefile.in`` processing.
Again we manually install ``automake`` script and its dependencies.
3. We now use ``aclocal``, ``autoconf``, and ``automake`` to do a proper build and install.
2021-03-17 00:24:37 +00:00
automake 1.4-p6
===============
This is an older version of GNU Automake. Various versions of GNU Autotools are not fully
compatible, and we will need older ``automake`` to build some older software.
autoconf 2.52
=============
2021-03-17 00:24:37 +00:00
We now properly rebuild ``autoconf`` using ``automake-1.4`` and manually installed ``autoconf``.
2021-02-27 20:07:04 +00:00
autoconf 2.13
=============
2021-03-17 00:24:37 +00:00
An older ``autoconf`` will be necessary to build GNU Binutils.
2021-02-27 22:42:23 +00:00
autoconf 2.12
=============
2021-03-17 00:24:37 +00:00
Yet another old autoconf version that we will need for some parts of GNU Binutils.
2021-02-27 23:29:59 +00:00
2021-03-17 22:17:11 +00:00
autoconf 2.53
=============
We now start bootstrapping newer versions of autoconf. Version 2.53 now uses ``perl``.
In order to build it with ``autoconf-2.52`` we have to patch it a bit.
2021-03-17 22:17:11 +00:00
2021-03-18 00:23:40 +00:00
automake 1.7
============
Automake 1.7 and Autoconf 2.54 depend on each other, so we patch out two offending
autoconf macros to make it build with ``autoconf-2.53``.
autoconf 2.54
=============
More ``autoconf``.
2021-03-18 17:47:12 +00:00
autoconf 2.55
=============
2021-03-18 19:25:50 +00:00
Even newer ``autoconf``. This is the last version of ``autoconf`` that is buildable
with ``automake-1.7``.
automake 1.7.8
==============
Newer ``automake``. This is the latest ``automake`` that is buildable with ``autoconf-2.55``.
2021-03-18 17:47:12 +00:00
2021-03-18 19:33:54 +00:00
autoconf 2.57
=============
Newer ``autoconf``. This time we were able to skip version 2.56.
2021-03-18 19:37:07 +00:00
autoconf 2.59
=============
Again, we managed to skip one version.
2021-03-18 19:53:06 +00:00
automake 1.8.5
==============
We need newer ``automake`` to proceed to newer ``autoconf`` versions. This is the latest
automake version from 1.8 release series.
2021-03-18 19:33:54 +00:00
2021-03-19 15:40:07 +00:00
help2man 1.36.4
===============
``help2man`` automatically generates manpages from programs ``--help`` and ``--version``
outputs. This is not strictly required for bootstrapping but will help us to avoid patching
build process to skip generation of manpages. This is the newest version of ``help2man`` that
does not require Perl 5.8.
2021-03-19 20:00:43 +00:00
autoconf 2.61
=============
Yet another version of ``autoconf``.
2021-03-20 21:23:07 +00:00
automake 1.9.6
==============
Latest GNU Automake from 1.9 series. Slightly annoyingly depends itself but it is easy to patch
to make it buildable with 1.8.5.
2021-03-20 21:23:07 +00:00
2021-03-25 20:39:08 +00:00
automake 1.10.3
===============
GNU Automake from 1.10 series. ``aclocal`` is slightly patched to work
with our ``perl``.
autoconf 2.64
2021-03-25 21:35:43 +00:00
=============
Slightly newer version of GNU Autoconf. At this stage Autoconf is mostly
backwards compatible but newer versions need newer ``automake``.
2023-02-18 03:07:17 +00:00
automake 1.11.2
===============
GNU Automake from 1.11 series. This is not the latest point release as newer ones
need Autoconf 2.68.
libtool 1.4
===========
GNU Libtool is the final part of GNU Autotools. It is a script used to hide away differences
when compiling shared libraries on different platforms.
binutils 2.14
=============
The GNU Binary Utilities, or binutils, are a set of programming tools for creating and
managing binary programs, object files, libraries, profile data, and assembly source code.
In particular we can now use full featured ``ar`` instead of ``tcc -ar``,
the GNU linker ``ld``, which allows us building shared libraries,
and the GNU assembler ``as``.
libtool 2.2.4
=============
Newer version of libtool which is more compatible with modern Autotools.
musl 1.1.24 (v3)
================
We rebuild musl for the third time. This time we use GNU ar rather than ``tcc -ar``,
so we can drop weak symbols patch. Also, we can use GNU as to build assembly source files,
so those assembly files that tcc failed to compile no longer have to be patched.
tcc 0.9.27 (musl v3)
====================
We rebuild tcc against new musl and without a patch to ignore duplicate symbols.
2021-03-20 21:23:07 +00:00
gcc 4.0.4
=========
2021-03-21 00:24:24 +00:00
The GNU Compiler Collection (GCC) is an optimizing compiler produced by the
GNU Project. GCC is a key component of the GNU toolchain and the standard
compiler for most projects related to GNU and the Linux kernel.
2021-03-20 21:23:07 +00:00
2021-03-21 00:24:24 +00:00
Only the C frontend is built at this stage.
2021-03-20 23:49:43 +00:00
At this stage we are not yet able to regenerate top-level ``Makefile.in``
which needs GNU Autogen and hence Guile. Luckily, building GCC without
top-level ``Makefile`` is fairly easy.
findutils 4.2.33
================
GNU Find Utilities can be used to search for files. We are mainly interested
in ``find`` and ``xargs`` that are often used in scripts.
2021-03-20 23:49:43 +00:00
musl 1.2.2
==========
2021-03-21 00:24:24 +00:00
GCC can build the latest as of the time of writing musl version.
We also don't need any of the TCC patches that we used before.
gcc 4.0.4
=========
Rebuild GCC with GCC and also against the latest musl.
2021-03-21 16:10:53 +00:00
2021-06-10 11:27:45 +01:00
util-linux 2.19.1
=================
``util-linux`` contains a number of general system administration utilities.
Most pressingly, we need these for being able to mount disks (for non-chroot
mode, but it is built it in chroot mode anyway because it will likely be useful
later). The latest version is not used because of autotools/GCC
incompatibilities.
kbd-1.15
========
``kbd`` contains ``loadkeys`` which is required for building the Linux kernel.
The 2.x series is not used because it requires particular features of autotools
2021-12-29 19:37:49 +00:00
that we do not have available.
2021-06-10 11:27:45 +01:00
make 3.82
=========
GNU ``make`` is now rebuilt properly using the build system and GCC, which means that
it does not randomly segfault while building the Linux kernel.
2021-06-10 11:27:45 +01:00
kexec-tools 2.0.22
==================
``kexec`` is a utility for the Linux kernel that allows the re-execution of the
Linux kernel without a manual restart from within a running system. It is a
kind of soft-restart. It is only built for non-chroot mode, as we only use it
in non-chroot mode. It is used to go into sysb/sysc.
create_sysb
===========
The next step is not a package, but the creation of the sysb rootfs, containing
all of the scripts for sysb (which merely move to sysc). Again, this is only
done in non-chroot mode, because sysb does not exist in chroot mode.
Linux kernel 4.9.10
===================
A lot going on here. This is the first (and currently only) time the Linux kernel
is built. Firstly, Linux kernel version 4.9.x is used because newer versions
require much more stringent requirements on the make, GCC, binutils versions.
However, the docs are also wrong, as the latest of the 4.9.x series does not
work with our version of binutils. However, a much earlier 4.9.10 does
(selected arbitarily, could go newer but did not test), with a small amount
of patching. This is also modern enough for most hardware and to cause few
problems with software built in sysc. Secondly, the linux-libre scripts are used
2022-05-04 01:09:24 +01:00
to deblob the kernel. Every other pregenerated file is appended with ``_shipped``
so we use a ``find`` command to remove those, which are automatically regenerated.
The kernel config was originally taken from Void Linux, and was then modified
for the requirements of live-bootstrap, including compiler features, drivers,
2022-05-04 01:09:24 +01:00
and removing modules. Modules are unused. They are difficult to transfer to
subsequent systems, and we do not have ``modprobe``. Lastly,
the initramfs of sysb is generated in this stage, using ``gen_init_cpio`` within
the Linux kernel tree. This avoids the compilation of ``cpio`` as well.
go_sysb
=======
This is the last step of sysa, run for non-chroot mode. It uses kexec to load
the new Linux kernel into RAM and execute it, moving into sysb.
2022-05-04 01:09:24 +01:00
In chroot, sysb is skipped, and data is transferred directly to sysc and
chrooted into.
sysb
====
sysb is purely a transition to sysc, allowing binaries from sysa to get onto a
2022-05-04 01:09:24 +01:00
disk (as sysa does not necessarily have hard disk support in the kernel).
It populates device nodes, mounts sysc, copies over data, and executes sysc.
2021-03-21 16:10:53 +00:00
bash 5.1
========
Up to this point, our build of ``bash`` could run scripts but could not be used
interactively. This new version of ``bash`` compiles without any patches,
provides new features, and is built with GNU readline support so it can be used
as an interactive shell. ``autoconf-2.61`` is used to regenerate the configure
script and ``bison`` is used to recreate some included generated files.
2021-03-25 21:47:57 +00:00
xz 5.0.5
========
XZ Utils is a set of free software command-line lossless data compressors,
including lzma and xz. In most cases, xz achieves higher compression rates
than alternatives like gzip and bzip2.
2021-03-28 23:18:33 +01:00
2023-02-14 05:57:22 +00:00
file 5.44
=========
file is a utility that is used to get information about files based upon their
magic.
libtool 2.4.7
=============
A modern version of libtool with better compatiblitiy with newer versions of GNU
Autotools.
2021-03-28 23:36:27 +01:00
autoconf 2.69
=============
2021-10-09 22:21:23 +01:00
This is a much newer version of GNU Autoconf.
2021-03-31 18:21:07 +01:00
2021-04-01 00:28:30 +01:00
automake 1.15.1
===============
GNU Automake from 1.15 series. This is the last version that runs on Perl 5.6.
2021-04-03 18:40:48 +01:00
2021-04-02 17:59:49 +01:00
tar 1.34
========
Newer tar has better support for decompressing .tar.bz2 and .tar.xz archives.
It also deals better with modern tar archives with extra metadata.
coreutils 8.32
==============
We build the latest available coreutils 8.32 which adds needed options to make
results of build metadata reproducible. For example, timestamps are changed with
``touch --no-dereference``.
2021-06-29 22:20:15 +01:00
pkg-config 0.29.2
=================
pkg-config is a helper tool that helps to insert compile and link time flags.
2021-04-28 07:15:58 +01:00
make 4.2.1
==========
A newer version of make built using autotools is much more reliable and is
compiled using a modern C compiler and C library. This removes a couple of
segfaults encountered later in the process and allows more modern make features
to be used. We do not go for the latest because of the use of automake 1.16
which we do not have yet.
2021-04-03 18:40:48 +01:00
gmp 6.2.1
=========
GNU Multiple Precision Arithmetic Library (GMP) is a free library for
arbitrary-precision arithmetic, operating on signed integers, rational numbers,
and floating-point numbers.
GMP is required by newer versions of GCC and Guile.
2021-04-17 20:16:25 +01:00
autoconf-archive 2021.02.19
===========================
The GNU Autoconf Archive is a collection of Autoconf macros that are used by
2021-04-17 23:54:19 +01:00
various projects and in particular GNU MPFR.
mpfr 4.1.0
==========
The GNU Multiple Precision Floating-Point Reliable Library (GNU MPFR) is a library
for arbitrary-precision binary floating-point computation with correct rounding,
based on GNU Multi-Precision Library.
2021-04-18 00:42:47 +01:00
mpc 3.2.1
=========
GNU MPC is a library for multiprecision complex arithmetic with exact rounding based
on GNU MPFR.
2021-05-24 00:45:07 +01:00
flex 2.5.33
===========
An older version of flex is required for bison 2.3. We cannot use 2.5.11 that
was compiled much earlier, as it does not produce reproducible output when
building bison 2.3.
bison 2.3
=========
This is an older version of bison required for the bison files in perl 5.10.1.
We backwards-bootstrap this from 3.4.1, using 3.4.1 to compile the bison files
in 2.3. This parser works sufficiently well for perl 5.10.5.
2021-04-28 07:20:41 +01:00
bison 3.4.2
===========
Bison 3.4.1 is buggy and segfaults when perl 5.32.1 is built. This is probably
because it was built with a hand-written makefile. We do not build the latest
bison because perl 5.32.1 requires bison <= 3.4.2.
perl 5.10.1
===========
Perl 5.10.1 is an intermediate version used before Perl 5.32. We require this
version as it adds a couple of modules into lib/ required to regenerate files in
Perl 5.32. We still use the Makefile instead of the metaconfig strategy, as
metaconfig history becomes poor more than a few years back.
dist 3.5-236
============
dist is perl's package used for generating Perl's Configure (which is written in
Perl itself). We 'compile' (aka generate) metaconfig and manifake only from dist.
We do not use dist's build system because it itself uses dist.
2021-04-28 07:23:34 +01:00
perl 5.32.1
===========
We finally compile a full version of Perl using Configure. This includes all base
extensions required and is the latest version of Perl. We are now basically able
to run any Perl application we want.
2021-05-26 00:48:05 +01:00
2022-05-04 01:09:24 +01:00
libarchive 3.5.2
================
``libarchive`` is a C library used to read and write archives.
openssl 1.1.1l
==============
OpenSSL is a C library for secure communications/cryptography. We do not
strictly use any of the networking functions of this library but it is a hard dependency
of XBPS.
zlib 1.2.13
2022-05-04 01:09:24 +01:00
===========
zlib is a software library used for data compression and implements an abstraction of
DEFLATE algorithm that is also used in ``gzip``.
2021-05-26 00:48:05 +01:00
automake 1.16.3
===============
GNU Automake from 1.16 series that required newer Perl.
2021-05-27 21:54:41 +01:00
2021-10-09 22:21:23 +01:00
autoconf 2.71
=============
GNU Autoconf 2.71 is even newer version of autoconf. It does not build with miniperl,
so we postponed it until full perl was built.
2021-05-27 21:54:41 +01:00
patch 2.7.6
===========
Our old patch was built with manual makefile and used mes libc.
This is a newer version which we need in order to import gnulib into gettext.
2021-05-27 21:58:10 +01:00
gettext 0.21
============
GNU Gettext is an internationalization and localization system used for writing
multilingual programs.
2021-05-29 01:05:21 +01:00
texinfo 6.7
===========
Texinfo is a typesetting syntax used for generating documentation. We can now use
``makeinfo`` script to convert ``.texi`` files into ``.info`` documentation format.
2021-06-27 16:10:10 +01:00
2021-07-01 19:59:10 +01:00
gcc 4.7.4
=========
2021-07-07 20:10:38 +01:00
GCC 4.7.4 is the last version written in C. This time we build both C and C++ backends.
C++ backend has some dependency on ``gperf`` which is written in C++. Fortunately, it is
easy to patch it out and resulting ``g++`` compiler is capable of building ``gperf``.
We also add in two patchsets to the compiler;
* one to add support for musl shared library support
* one providing a few compiler flags/features that are required later to build GCC 10
2021-07-01 19:59:10 +01:00
2022-09-04 09:12:21 +01:00
binutils 2.38
=============
This version of binutils provides a more comprehensive set of programming tools for
creating and managing binary programs. It also includes modern versions of the ``ld``
linker, the ``as`` assembler and the ``ar`` program.
2021-07-08 01:42:56 +01:00
gperf 3.1
=========
``gperf`` is a perfect hash function generator (hash function is injective).
2021-06-27 16:10:10 +01:00
libunistring 0.9.10
===================
Library for manipulating Unicode and C strings according to Unicode standard. This
is a dependency of GNU Guile.
2021-07-01 21:59:05 +01:00
libffi 3.3
==========
The libffi library provides a portable, high level programming interface to various
calling conventions.
2021-07-01 23:14:17 +01:00
libatomic_ops 7.6.10
====================
``libatomic_ops`` provides semi-portable access to hardware-provided atomic memory
update operations on a number of architectures.
2021-07-02 01:09:54 +01:00
boehm-gc 8.0.4
==============
The Boehm-Demers-Weiser conservative garbage collector can be used as a garbage
collecting replacement for C malloc or C++ new.
2021-07-03 17:38:05 +01:00
guile 3.0.7
===========
GNU Ubiquitous Intelligent Language for Extensions (GNU Guile) is the preferred
extension language system for the GNU Project and features an implementation
of the programming language Scheme.
We use ``guile-psyntax-bootstrapping`` project to bootstrap Guile's ``psyntax.pp``
without relying on pre-expanded code.
2022-09-16 22:40:00 +01:00
which 2.21
==========
``which`` shows the full path of (shell) commands. It mostly duplicates
bash built-in ``command -v`` but some scripts call ``which`` instead.
In particular, ``autogen`` scripts use it.
2022-09-19 21:45:14 +01:00
grep 3.7
========
Newer ``grep`` will be needed to bootstrap ``autogen``.
2022-09-23 22:03:32 +01:00
sed 4.8
=======
Earlier ``sed`` was built with manual makefile with most features compiled out.
Build a newer ``sed`` using GNU Autotools build system. In particular this will let
sed keep executable bit on after in place editing.
2022-09-23 22:03:59 +01:00
autogen 5.18.16
===============
GNU Autogen is a tool designed to simplify the creation and maintenance of programs that contain large amounts of repetitious text. Unfortunately, the source is full of pregenerated files that require ``autogen`` to rebuild.
We will use `gnu-autogen-bootstrapping <https://github.com/schierlm/gnu-autogen-bootstrapping>`_ project to rebuild
those and create slightly crippled ``autogen`` that is then able to build full-featured version.
2022-10-06 00:57:44 +01:00
musl 1.2.3
==========
With GCC and binutils supporting a musl-based toolchain natively, musl itself is rebuilt
with support for dynamic linking.
2023-01-16 22:23:14 +00:00
python 2.0.1
2023-01-16 22:23:14 +00:00
============
Everything is in place to bootstrap the useful programming language/utility
Python. While Python is largely written in C, many parts of the codebase are
generated from Python scripts, which only increases as Python matured over time.
We begin with Python 2.0.1, which has minimal generated code, most of which can
be removed. Lib/{keyword,token,symbol} scripts are rewritten in C and used to
regenerate parts of the standard library. Unicode support and sre (regex)
support is stripped out.
Using the stripped-down first version of Python 2.0.1, Python 2.0.1 is rebuilt,
including Unicode and regex support (required for future Python builds). The
first version is insufficient to run the Lib/{keyword,token,symbol} scripts, so
those continue to use the C versions.
Precompiled Python code at this point is highly unreproducible, so it is
deleted (JIT compiled instead). This makes Python itself slower, but this is of
little consequence.
python 2.3.7
2023-01-16 22:23:14 +00:00
============
Python 2.0.1 is sufficient to build Python 2.3.7.
Differences to 2.0.1:
* The new "ast" module, performing parsing of Python, is generated from a
parsing specification using Python code.
* 2.0.1 is insufficient to run 2.3.7's unicode regeneration, so Unicode
support is again stripped out.
Python 2.3.7 is then rebuilt to include Unicode support.
python 2.5.6
2023-01-16 22:23:14 +00:00
============
Python 2.3.7 is sufficient to build Python 2.5.6, with a few minimal changes to
language constructs in scripts. This is the last 2.x version we build.
Differences to 2.3.7 are very minimal.
python 3.1.5
2023-01-16 22:23:14 +00:00
============
Python 2.5.6 is new enough to be able to build Python 3.1.5, allowing us to move
into the modern 3.x series of Python. Various patching is required, as some
scripts in the tree are still Python 2 while others are Python 3. We have to
convert the Python 3 ones back to Python 2 to be able to use Python 2.5.6.
Differences to 2.5.6:
* An include cycle when a distributed file is removed arises, we have to jump
through some hoops to make this work.
* At the second pass of building, various charset encodings can be regenerated &
used in the standard library (required in future Python 3.x).
* The new ssl Python library is disabled due to our OpenSSL version being too
new.
Python 3.1.5 is rebuilt, using Python 3 for the Python 3 scripts in the tree.
python 3.3.7
2023-01-16 22:23:14 +00:00
============
Python 3.1.5 is sufficient to build Python 3.3.7 (rapid language change = small
jumps).
Differences to 3.1.5:
* The ssl Python library can now be re-enabled, and ``_ssl_data.h`` regenerated.
python 3.4.10
2023-01-16 22:23:14 +00:00
=============
Python 3.3.7 is sufficient to build Python 3.4.10.
Differences to 3.3.7:
* The clinic tool has been introduced, which unifies documentation with code.
Clinic creates many generated files. We run the clinic tool across all files
using clinic.
* The ssl library breaks in much more ugly ways than before, but unlike previous
versions, it passes over this error silently.
python 3.8.16
2023-01-16 22:23:14 +00:00
=============
Python 3.4.10 is sufficient to build Python 3.8.16.
Differences to 3.4.10:
* The build system has been significantly revamped (coming in line with modern
standards).
* Many of our previous regenerations can be replaced with one ``make regen-all``
invocation.
* The stringprep Python module, previously deleted, is now required, so it is
regenerated.
python 3.11.1
2023-01-16 22:23:14 +00:00
=============
The newest version of Python, Python 3.11.1 can now be built.
Differences to 3.8.16:
* Unfortunately, the build system has regressed slightly. We must choose the
order to perform regenerations in the Makefile ourselves, as some
regenerations use other regenerations, but the Makefile does not include them
as dependencies.
* The concept of "frozen" modules has been introduced, adding a layer of
complexity to regeneration.
* ``stdlib_module_names.h`` is a new file that must be built using data from a
current Python binary. To achieve this, a dummy ``stdlib_module_names.h`` is used
for the build, then ``stdlib_module_names.h`` is created, and Python is
rebuilt using the proper ``stdlib_module_names.h``. Unfortunately this
greatly increases the time taken to build Python, but it is not trivial to
work around.
* A new generated script ``Lib/re/_casefix.py`` is introduced.
* The ssl module, now unbroken, can be built again.
* Very recent Python versions allow for the use of ``SOURCE_DATE_EPOCH`` to
remove determinism from precompiled Python libraries (``.pyc``). Finally, we
can re-enable compiling of Python modules.
GCC 10.4.0
==========
GCC 10.x series is the last version of GCC that is able to be built with the
C/C++ standards available in GCC 4.7. Instead of manually configuring & compiling
every subdirectory, since we now have ``autogen`` available we are able to use
the top-level configure to build the project. We do not use GCC's bootstrap mode,
where GCC is recompiled with itself after being built, since we're just going
to use this GCC to compile GCC 12, it adds build time for little benefit.