GNOME.org

[Notes] [Git][BuildStream/buildstream][valentindavid/cache_server_fill_up] 25 commits: _project.py: Validate nodes early in Project._load

From: Valentin David <gitlab mg gitlab com>
To: buildstream-notifications-list gnome org
Subject: [Notes] [Git][BuildStream/buildstream][valentindavid/cache_server_fill_up] 25 commits: _project.py: Validate nodes early in Project._load
Date: Thu, 15 Nov 2018 11:02:16 +0000

Title: GitLab

Valentin David pushed to branch valentindavid/cache_server_fill_up at BuildStream / buildstream

Commits:

ae5ccd76

by Phillip Smyth at 2018-11-12T14:57:51Z

_project.py: Validate nodes early in Project._load

_project.py: Added validate_nodes() helper function to prevent duplicate lists

element-path was being used before node validation resulting in uncaught errors

56a54161

by Phillip Smyth at 2018-11-12T14:57:51Z

tests/frontend: Add test for invalid element-path

e55a9703

by Jürg Billeter at 2018-11-12T15:44:45Z

Merge branch 'element-path_not_validated' into 'master'

Element path not validated before use

See merge request BuildStream/buildstream!937

c87d6e45
by Valentin David at 2018-11-14T11:43:33Z
```
Run tests on aarch64

Fixes #755
```

16b01489

by Valentin David at 2018-11-14T11:43:33Z

Disable cachekey tests on other architectures than x86_64

85046b29

by Valentin David at 2018-11-14T11:43:33Z

Disable tests on example on other architectures than x86_64

dd5a073b
by Valentin David at 2018-11-14T11:43:33Z
```
Add support for aarch64 in a test
```
c5d72ae7
by Valentin David at 2018-11-14T11:52:53Z
```
Test aarch64 only overnight
```

264a57f6

by Javier Jardón at 2018-11-14T12:22:03Z

Merge branch 'valentindavid/integration-tests-aarch64' into 'master'

Run tests on aarch64

Closes #369 and #755

See merge request BuildStream/buildstream!948

da735e56

by Richard Maw at 2018-11-14T13:30:34Z

_platform/linux.py: Move get_bwrap_version into _site.py

It's inconvenient to have to create a Linux platform to parse the bwrap version
and we want to get the version in a consistent manner.

e7633500

by Richard Maw at 2018-11-14T13:30:34Z

buildstream/sandbox/_sandboxbwrap.py: Distinguish sandbox failure from command failure

If `bwrap` fails to set up the sandbox and start the payload command
it won't write an exit-code in --json-status-fd,
so we can report if it was a sandboxing failure if we don't get exit-code status
and a payload command failure if we do and it's non-zero.

Closes https://gitlab.com/BuildStream/buildstream/issues/286

e9e08823

by Richard Maw at 2018-11-14T13:30:34Z

tests/testutils/site.py: Check for bwrap supporting --json-status-fd

90ca007e

by Richard Maw at 2018-11-14T13:30:34Z

tests/integration/sandbox-bwrap.py: Test distinguishing sandbox exit code from command

327b19dd

by richardmaw-codethink at 2018-11-14T13:59:16Z

Merge branch 'richardmaw/distinguish-sandboxing-build-fail' into 'master'

Distinguish between bubblewrap sandboxing failure and command failure

Closes #286

See merge request BuildStream/buildstream!868

5a093200

by Valentin David at 2018-11-15T11:01:50Z

Avoid copying temporary file when adding object to CAS in server.

The file is already a temporary file and does not need copy.  ENOSPC
is thrown during that copy in issue #609.

Fixes #678.

4473e1aa

by Valentin David at 2018-11-15T11:01:50Z

Re-check disk space for each write and handle ENOSPC in artifact server.

There were some race conditions in the way we store artifacts in the
server. Now we verify we still have space on disk for every write. We
also handle ENOSPC to reallocate space or if we cannot properly fail
the connection.

This should help for #609.

b0588279

by Valentin David at 2018-11-15T11:01:50Z

Do not remove newly uploaded objects when cleaning up cache on server.

Issue was found while investigating for #609. When the disk gets
filled up, we would prune non-referenced objects. Since this happens
during an upload of object before a reference is added for uploaded
objects, then we would have an incomplete upload. Further pulls would
result in a missing object error.

The solution is to let 6 hours pass before we can prune an object.  To
implement this fix without this minimum age "trick", we would need to
change protocol and deal with concurrent pushes.

6abdf7af

by Valentin David at 2018-11-15T11:01:50Z

Use fallocate instead of checking for disk space for every write

85b2b4ee

by Valentin David at 2018-11-15T11:01:50Z

Fix race condition getting mtime in FindMissingBlobs

8e808c56
by Valentin David at 2018-11-15T11:01:50Z
```
Fix various small issues from review comments
```

ee2e033c

by Valentin David at 2018-11-15T11:01:50Z

Make cache clients not fail when a blob is not available.

We plan to make cache incomplete. That is some blobs are missing.  For
most of cases we will delete references when requested if they are
incomplete. But there will be corner cases where objects are removed
after the reference is requested.

c54f1512

by Valentin David at 2018-11-15T11:01:50Z

Update mtimes of objects for requested references.

This also remove references when some objects are missing. This is in
preparation for the move from reference to object garbage collection.

a5c5ac31
by Valentin David at 2018-11-15T11:01:50Z
```
Fix type error in RPC messages
```

502b5fe1

by Valentin David at 2018-11-15T11:01:50Z

Use f_bavail to query available space. Not f_bfree.

f_bfree space might not be usable. In practice we see failures in big
disks because f_bfree is over 2GB and f_bavail is 0. We get ENOSPC if
writing on disk then.

85cd0d96

by Valentin David at 2018-11-15T11:01:50Z

Move cas server from ref-based to object-based garbage collection.

Also add locking to not collect in parallel. Collect a bit more than
needed. Make the cache sizes configurable.

26 changed files:

.gitlab-ci.yml
buildstream/_artifactcache/cascache.py
buildstream/_artifactcache/casserver.py
buildstream/_platform/linux.py
buildstream/_project.py
buildstream/_site.py
buildstream/sandbox/_sandboxbwrap.py
tests/cachekey/cachekey.py
tests/examples/autotools.py
tests/examples/developing.py
tests/examples/flatpak-autotools.py
tests/examples/integration-commands.py
tests/examples/junctions.py
tests/examples/running-commands.py
tests/format/list-directive-type-error/project.conf
+ tests/frontend/invalid_element_path/project.conf
tests/frontend/push.py
tests/frontend/show.py
tests/integration/project/elements/base/base-alpine.bst
+ tests/integration/project/elements/sandbox-bwrap/break-shell.bst
+ tests/integration/project/elements/sandbox-bwrap/command-exit-42.bst
+ tests/integration/project/elements/sandbox-bwrap/non-executable-shell.bst
tests/integration/project/project.conf
tests/integration/sandbox-bwrap.py
tests/testutils/artifactshare.py
tests/testutils/site.py

Changes:

.gitlab-ci.yml

@@ -79,32 +79,46 @@ source_dist:
    - cd ../..
    - mkdir -p coverage-linux/
    - cp dist/buildstream/.coverage coverage-linux/coverage."${CI_JOB_NAME}"
 -  except:
 -  - schedules
    artifacts:
      paths:
      - coverage-linux/
  tests-debian-9:
 -  image: buildstream/testsuite-debian:9-master-119-552f5fc6
 +  image: buildstream/testsuite-debian:9-master-123-7ce6581b
    <<: *linux-tests
 +  except:
 +  - schedules
  tests-fedora-27:
 -  image: buildstream/testsuite-fedora:27-master-119-552f5fc6
 +  image: buildstream/testsuite-fedora:27-master-123-7ce6581b
    <<: *linux-tests
 +  except:
 +  - schedules
  tests-fedora-28:
 -  image: buildstream/testsuite-fedora:28-master-119-552f5fc6
 +  image: buildstream/testsuite-fedora:28-master-123-7ce6581b
    <<: *linux-tests
 +  except:
 +  - schedules
  tests-ubuntu-18.04:
 -  image: buildstream/testsuite-ubuntu:18.04-master-119-552f5fc6
 +  image: buildstream/testsuite-ubuntu:18.04-master-123-7ce6581b
    <<: *linux-tests
 +  except:
 +  - schedules
++
 +overnight-fedora-28-aarch64:
 +  image: buildstream/testsuite-fedora:aarch64-28-master-123-7ce6581b
 +  tags:
 +    - aarch64
 +  <<: *linux-tests
 +  only:
 +  - schedules
  tests-unix:
    # Use fedora here, to a) run a test on fedora and b) ensure that we
    # can get rid of ostree - this is not possible with debian-8
 -  image: buildstream/testsuite-fedora:27-master-119-552f5fc6
 +  image: buildstream/testsuite-fedora:27-master-123-7ce6581b
    stage: test
    variables:
      BST_FORCE_BACKEND: "unix"

buildstream/_artifactcache/cascache.py

@@ -25,6 +25,7 @@ import stat
  import tempfile
  import uuid
  import errno
 +import contextlib
  from urllib.parse import urlparse
  import grpc
@@ -43,6 +44,13 @@ from .._exceptions import CASError
  _MAX_PAYLOAD_BYTES = 1024 * 1024
 +class BlobNotFound(CASError):
++
 +    def __init__(self, blob, msg):
 +        self.blob = blob
 +        super().__init__(msg)
++
++
  # A CASCache manages a CAS repository as specified in the Remote Execution API.
+ #
  # Args:
@@ -219,6 +227,8 @@ class CASCache():
                  raise CASError("Failed to pull ref {}: {}".format(ref, e)) from e
              else:
                  return False
 +        except BlobNotFound as e:
 +            return False
      # pull_tree():
+     #
@@ -391,13 +401,14 @@ class CASCache():
      #     digest (Digest): An optional Digest object to populate
      #     path (str): Path to file to add
      #     buffer (bytes): Byte buffer to add
 +    #     link_directly (bool): Whether file given by path can be linked
+     #
      # Returns:
      #     (Digest): The digest of the added object
+     #
      # Either `path` or `buffer` must be passed, but not both.
+     #
 -    def add_object(self, *, digest=None, path=None, buffer=None):
 +    def add_object(self, *, digest=None, path=None, buffer=None, link_directly=False):
          # Exactly one of the two parameters has to be specified
          assert (path is None) != (buffer is None)
@@ -407,28 +418,34 @@ class CASCache():
          try:
              h = hashlib.sha256()
              # Always write out new file to avoid corruption if input file is modified
 -            with tempfile.NamedTemporaryFile(dir=self.tmpdir) as out:
 -                # Set mode bits to 0644
 -                os.chmod(out.name, stat.S_IRUSR | stat.S_IWUSR | stat.S_IRGRP | stat.S_IROTH)
+-
 -                if path:
 -                    with open(path, 'rb') as f:
 -                        for chunk in iter(lambda: f.read(4096), b""):
 -                            h.update(chunk)
 -                            out.write(chunk)
 +            with contextlib.ExitStack() as stack:
 +                if path is not None and link_directly:
 +                    tmp = stack.enter_context(open(path, 'rb'))
 +                    for chunk in iter(lambda: tmp.read(4096), b""):
 +                        h.update(chunk)
                  else:
 -                    h.update(buffer)
 -                    out.write(buffer)
 +                    tmp = stack.enter_context(tempfile.NamedTemporaryFile(dir=self.tmpdir))
 +                    # Set mode bits to 0644
 +                    os.chmod(tmp.name, stat.S_IRUSR | stat.S_IWUSR | stat.S_IRGRP | stat.S_IROTH)
 -                out.flush()
 +                    if path:
 +                        with open(path, 'rb') as f:
 +                            for chunk in iter(lambda: f.read(4096), b""):
 +                                h.update(chunk)
 +                                tmp.write(chunk)
 +                    else:
 +                        h.update(buffer)
 +                        tmp.write(buffer)
++
 +                    tmp.flush()
                  digest.hash = h.hexdigest()
 -                digest.size_bytes = os.fstat(out.fileno()).st_size
 +                digest.size_bytes = os.fstat(tmp.fileno()).st_size
                  # Place file at final location
                  objpath = self.objpath(digest)
                  os.makedirs(os.path.dirname(objpath), exist_ok=True)
 -                os.link(out.name, objpath)
 +                os.link(tmp.name, objpath)
          except FileExistsError as e:
              # We can ignore the failed link() if the object is already in the repo.
@@ -526,6 +543,41 @@ class CASCache():
          # first ref of this list will be the file modified earliest.
          return [ref for _, ref in sorted(zip(mtimes, refs))]
 +    # list_objects():
 +    #
 +    # List cached objects in Least Recently Modified (LRM) order.
 +    #
 +    # Returns:
 +    #     (list) - A list of objects and timestamps in LRM order
 +    #
 +    def list_objects(self):
 +        objs = []
 +        mtimes = []
++
 +        for root, _, files in os.walk(os.path.join(self.casdir, 'objects')):
 +            for filename in files:
 +                obj_path = os.path.join(root, filename)
 +                try:
 +                    mtimes.append(os.path.getmtime(obj_path))
 +                except FileNotFoundError:
 +                    pass
 +                else:
 +                    objs.append(obj_path)
++
 +        # NOTE: Sorted will sort from earliest to latest, thus the
 +        # first element of this list will be the file modified earliest.
 +        return sorted(zip(mtimes, objs))
++
 +    def clean_up_refs_until(self, time):
 +        ref_heads = os.path.join(self.casdir, 'refs', 'heads')
++
 +        for root, _, files in os.walk(ref_heads):
 +            for filename in files:
 +                ref_path = os.path.join(root, filename)
 +                # Obtain the mtime (the time a file was last modified)
 +                if os.path.getmtime(ref_path) < time:
 +                    os.unlink(ref_path)
++
      # remove():
+     #
      # Removes the given symbolic ref from the repo.
@@ -559,7 +611,12 @@ class CASCache():
+     #
      # Prune unreachable objects from the repo.
+     #
 -    def prune(self):
 +    # Args:
 +    #    keep_after (int|None): timestamp after which unreachable objects
 +    #                           are kept. None if no unreachable object
 +    #                           should be kept.
 +    #
 +    def prune(self, keep_after=None):
          ref_heads = os.path.join(self.casdir, 'refs', 'heads')
          pruned = 0
@@ -580,11 +637,19 @@ class CASCache():
                  objhash = os.path.basename(root) + filename
                  if objhash not in reachable:
                      obj_path = os.path.join(root, filename)
 +                    if keep_after:
 +                        st = os.stat(obj_path)
 +                        if st.st_mtime >= keep_after:
 +                            continue
                      pruned += os.stat(obj_path).st_size
                      os.unlink(obj_path)
          return pruned
 +    def update_tree_mtime(self, tree):
 +        reachable = set()
 +        self._reachable_refs_dir(reachable, tree, update_mtime=True)
++
      ################################################
      #             Local Private Methods            #
      ################################################
@@ -729,10 +794,13 @@ class CASCache():
                  a += 1
                  b += 1
 -    def _reachable_refs_dir(self, reachable, tree):
 +    def _reachable_refs_dir(self, reachable, tree, update_mtime=False):
          if tree.hash in reachable:
              return
 +        if update_mtime:
 +            os.utime(self.objpath(tree))
++
          reachable.add(tree.hash)
          directory = remote_execution_pb2.Directory()
@@ -741,10 +809,12 @@ class CASCache():
              directory.ParseFromString(f.read())
          for filenode in directory.files:
 +            if update_mtime:
 +                os.utime(self.objpath(filenode.digest))
              reachable.add(filenode.digest.hash)
          for dirnode in directory.directories:
 -            self._reachable_refs_dir(reachable, dirnode.digest)
 +            self._reachable_refs_dir(reachable, dirnode.digest, update_mtime=update_mtime)
      def _required_blobs(self, directory_digest):
          # parse directory, and recursively add blobs
@@ -798,7 +868,7 @@ class CASCache():
          with tempfile.NamedTemporaryFile(dir=self.tmpdir) as f:
              self._fetch_blob(remote, digest, f)
 -            added_digest = self.add_object(path=f.name)
 +            added_digest = self.add_object(path=f.name, link_directly=True)
              assert added_digest.hash == digest.hash
          return objpath
@@ -809,7 +879,7 @@ class CASCache():
                  f.write(data)
                  f.flush()
 -                added_digest = self.add_object(path=f.name)
 +                added_digest = self.add_object(path=f.name, link_directly=True)
                  assert added_digest.hash == digest.hash
      # Helper function for _fetch_directory().
@@ -1113,6 +1183,9 @@ class _CASBatchRead():
          batch_response = self._remote.cas.BatchReadBlobs(self._request)
          for response in batch_response.responses:
 +            if response.status.code == code_pb2.NOT_FOUND:
 +                raise BlobNotFound(response.digest.hash, "Failed to download blob {}: {}".format(
 +                    response.digest.hash, response.status.code))
              if response.status.code != code_pb2.OK:
                  raise CASError("Failed to download blob {}: {}".format(
                      response.digest.hash, response.status.code))

buildstream/_artifactcache/casserver.py

@@ -24,6 +24,9 @@ import signal
  import sys
  import tempfile
  import uuid
 +import errno
 +import ctypes
 +import threading
  import click
  import grpc
@@ -31,6 +34,7 @@ import grpc
  from .._protos.build.bazel.remote.execution.v2 import remote_execution_pb2, remote_execution_pb2_grpc
  from .._protos.google.bytestream import bytestream_pb2, bytestream_pb2_grpc
  from .._protos.buildstream.v2 import buildstream_pb2, buildstream_pb2_grpc
 +from .._protos.google.rpc import code_pb2
  from .._exceptions import CASError
@@ -41,6 +45,10 @@ from .cascache import CASCache
  # Limit payload to 1 MiB to leave sufficient headroom for metadata.
  _MAX_PAYLOAD_BYTES = 1024 * 1024
 +# The minimum age in seconds for objects before they can be cleaned
 +# up.
 +_OBJECT_MIN_AGE = 6 * 60 * 60
++
  # Trying to push an artifact that is too large
  class ArtifactTooLargeException(Exception):
@@ -55,18 +63,22 @@ class ArtifactTooLargeException(Exception):
  #     repo (str): Path to CAS repository
  #     enable_push (bool): Whether to allow blob uploads and artifact updates
+ #
 -def create_server(repo, *, enable_push):
 +def create_server(repo, *, enable_push,
 +                  max_head_size=int(10e9),
 +                  min_head_size=int(2e9)):
      cas = CASCache(os.path.abspath(repo))
      # Use max_workers default from Python 3.5+
      max_workers = (os.cpu_count() or 1) * 5
      server = grpc.server(futures.ThreadPoolExecutor(max_workers))
 +    cache_cleaner = _CacheCleaner(cas, max_head_size, min_head_size)
++
      bytestream_pb2_grpc.add_ByteStreamServicer_to_server(
 -        _ByteStreamServicer(cas, enable_push=enable_push), server)
 +        _ByteStreamServicer(cas, cache_cleaner, enable_push=enable_push), server)
      remote_execution_pb2_grpc.add_ContentAddressableStorageServicer_to_server(
 -        _ContentAddressableStorageServicer(cas, enable_push=enable_push), server)
 +        _ContentAddressableStorageServicer(cas, cache_cleaner, enable_push=enable_push), server)
      remote_execution_pb2_grpc.add_CapabilitiesServicer_to_server(
          _CapabilitiesServicer(), server)
@@ -84,9 +96,19 @@ def create_server(repo, *, enable_push):
  @click.option('--client-certs', help="Public client certificates for TLS (PEM-encoded)")
  @click.option('--enable-push', default=False, is_flag=True,
                help="Allow clients to upload blobs and update artifact cache")
 +@click.option('--head-room-min', type=click.INT,
 +              help="Disk head room minimum in bytes",
 +              default=2e9)
 +@click.option('--head-room-max', type=click.INT,
 +              help="Disk head room maximum in bytes",
 +              default=10e9)
  @click.argument('repo')
 -def server_main(repo, port, server_key, server_cert, client_certs, enable_push):
 -    server = create_server(repo, enable_push=enable_push)
 +def server_main(repo, port, server_key, server_cert, client_certs, enable_push,
 +                head_room_min, head_room_max):
 +    server = create_server(repo,
 +                           max_head_size=head_room_max,
 +                           min_head_size=head_room_min,
 +                           enable_push=enable_push)
      use_tls = bool(server_key)
@@ -127,11 +149,43 @@ def server_main(repo, port, server_key, server_cert, client_certs, enable_push):
          server.stop(0)
 +class _FallocateCall:
++
 +    FALLOC_FL_KEEP_SIZE = 1
 +    FALLOC_FL_PUNCH_HOLE = 2
 +    FALLOC_FL_NO_HIDE_STALE = 4
 +    FALLOC_FL_COLLAPSE_RANGE = 8
 +    FALLOC_FL_ZERO_RANGE = 16
 +    FALLOC_FL_INSERT_RANGE = 32
 +    FALLOC_FL_UNSHARE_RANGE = 64
++
 +    def __init__(self):
 +        self.libc = ctypes.CDLL("libc.so.6", use_errno=True)
 +        try:
 +            self.fallocate64 = self.libc.fallocate64
 +        except AttributeError:
 +            self.fallocate = self.libc.fallocate
++
 +    def __call__(self, fd, mode, offset, length):
 +        if hasattr(self, 'fallocate64'):
 +            ret = self.fallocate64(ctypes.c_int(fd), ctypes.c_int(mode),
 +                                   ctypes.c_int64(offset), ctypes.c_int64(length))
 +        else:
 +            ret = self.fallocate(ctypes.c_int(fd), ctypes.c_int(mode),
 +                                 ctypes.c_int(offset), ctypes.c_int(length))
 +        if ret == -1:
 +            err = ctypes.get_errno()
 +            raise OSError(errno, os.strerror(err))
 +        return ret
++
++
  class _ByteStreamServicer(bytestream_pb2_grpc.ByteStreamServicer):
 -    def __init__(self, cas, *, enable_push):
 +    def __init__(self, cas, cache_cleaner, *, enable_push):
          super().__init__()
          self.cas = cas
          self.enable_push = enable_push
 +        self.fallocate = _FallocateCall()
 +        self.cache_cleaner = cache_cleaner
      def Read(self, request, context):
          resource_name = request.resource_name
@@ -189,25 +243,44 @@ class _ByteStreamServicer(bytestream_pb2_grpc.ByteStreamServicer):
                          context.set_code(grpc.StatusCode.NOT_FOUND)
                          return response
 -                    try:
 -                        _clean_up_cache(self.cas, client_digest.size_bytes)
 -                    except ArtifactTooLargeException as e:
 -                        context.set_code(grpc.StatusCode.RESOURCE_EXHAUSTED)
 -                        context.set_details(str(e))
 -                        return response
 +                    while True:
 +                        if client_digest.size_bytes == 0:
 +                            break
 +                        try:
 +                            self.cache_cleaner.clean_up(client_digest.size_bytes)
 +                        except ArtifactTooLargeException as e:
 +                            context.set_code(grpc.StatusCode.RESOURCE_EXHAUSTED)
 +                            context.set_details(str(e))
 +                            return response
++
 +                        try:
 +                            self.fallocate(out.fileno(), 0, 0, client_digest.size_bytes)
 +                            break
 +                        except OSError as e:
 +                            # Multiple upload can happen in the same time
 +                            if e.errno != errno.ENOSPC:
 +                                raise
++
                  elif request.resource_name:
                      # If it is set on subsequent calls, it **must** match the value of the first request.
                      if request.resource_name != resource_name:
                          context.set_code(grpc.StatusCode.FAILED_PRECONDITION)
                          return response
++
 +                if (offset + len(request.data)) > client_digest.size_bytes:
 +                    context.set_code(grpc.StatusCode.FAILED_PRECONDITION)
 +                    return response
++
                  out.write(request.data)
++
                  offset += len(request.data)
++
                  if request.finish_write:
                      if client_digest.size_bytes != offset:
                          context.set_code(grpc.StatusCode.FAILED_PRECONDITION)
                          return response
                      out.flush()
 -                    digest = self.cas.add_object(path=out.name)
 +                    digest = self.cas.add_object(path=out.name, link_directly=True)
                      if digest.hash != client_digest.hash:
                          context.set_code(grpc.StatusCode.FAILED_PRECONDITION)
                          return response
@@ -220,18 +293,26 @@ class _ByteStreamServicer(bytestream_pb2_grpc.ByteStreamServicer):
  class _ContentAddressableStorageServicer(remote_execution_pb2_grpc.ContentAddressableStorageServicer):
 -    def __init__(self, cas, *, enable_push):
 +    def __init__(self, cas, cache_cleaner, *, enable_push):
          super().__init__()
          self.cas = cas
          self.enable_push = enable_push
 +        self.cache_cleaner = cache_cleaner
      def FindMissingBlobs(self, request, context):
          response = remote_execution_pb2.FindMissingBlobsResponse()
          for digest in request.blob_digests:
 -            if not _has_object(self.cas, digest):
 -                d = response.missing_blob_digests.add()
 -                d.hash = digest.hash
 -                d.size_bytes = digest.size_bytes
 +            objpath = self.cas.objpath(digest)
 +            try:
 +                os.utime(objpath)
 +            except OSError as e:
 +                if e.errno != errno.ENOENT:
 +                    raise
 +                else:
 +                    d = response.missing_blob_digests.add()
 +                    d.hash = digest.hash
 +                    d.size_bytes = digest.size_bytes
++
          return response
      def BatchReadBlobs(self, request, context):
@@ -250,12 +331,12 @@ class _ContentAddressableStorageServicer(remote_execution_pb2_grpc.ContentAddres
              try:
                  with open(self.cas.objpath(digest), 'rb') as f:
                      if os.fstat(f.fileno()).st_size != digest.size_bytes:
 -                        blob_response.status.code = grpc.StatusCode.NOT_FOUND
 +                        blob_response.status.code = code_pb2.NOT_FOUND
                          continue
                      blob_response.data = f.read(digest.size_bytes)
              except FileNotFoundError:
 -                blob_response.status.code = grpc.StatusCode.NOT_FOUND
 +                blob_response.status.code = code_pb2.NOT_FOUND
          return response
@@ -285,7 +366,7 @@ class _ContentAddressableStorageServicer(remote_execution_pb2_grpc.ContentAddres
                  continue
              try:
 -                _clean_up_cache(self.cas, digest.size_bytes)
 +                self.cache_cleaner.clean_up(digest.size_bytes)
                  with tempfile.NamedTemporaryFile(dir=self.cas.tmpdir) as out:
                      out.write(blob_request.data)
@@ -328,6 +409,12 @@ class _ReferenceStorageServicer(buildstream_pb2_grpc.ReferenceStorageServicer):
          try:
              tree = self.cas.resolve_ref(request.key, update_mtime=True)
 +            try:
 +                self.cas.update_tree_mtime(tree)
 +            except FileNotFoundError:
 +                self.cas.remove(request.key, defer_prune=True)
 +                context.set_code(grpc.StatusCode.NOT_FOUND)
 +                return response
              response.digest.hash = tree.hash
              response.digest.size_bytes = tree.size_bytes
@@ -400,60 +487,80 @@ def _digest_from_upload_resource_name(resource_name):
          return None
 -def _has_object(cas, digest):
 -    objpath = cas.objpath(digest)
 -    return os.path.exists(objpath)
 +class _CacheCleaner:
 +    __cleanup_cache_lock = threading.Lock()
 -# _clean_up_cache()
 -#
 -# Keep removing Least Recently Pushed (LRP) artifacts in a cache until there
 -# is enough space for the incoming artifact
 -#
 -# Args:
 -#   cas: CASCache object
 -#   object_size: The size of the object being received in bytes
 -#
 -# Returns:
 -#   int: The total bytes removed on the filesystem
 -#
 -def _clean_up_cache(cas, object_size):
 -    # Determine the available disk space, in bytes, of the file system
 -    # which mounts the repo
 -    stats = os.statvfs(cas.casdir)
 -    buffer_ = int(2e9)                # Add a 2 GB buffer
 -    free_disk_space = (stats.f_bfree * stats.f_bsize) - buffer_
 -    total_disk_space = (stats.f_blocks * stats.f_bsize) - buffer_
+-
 -    if object_size > total_disk_space:
 -        raise ArtifactTooLargeException("Artifact of size: {} is too large for "
 -                                        "the filesystem which mounts the remote "
 -                                        "cache".format(object_size))
+-
 -    if object_size <= free_disk_space:
 -        # No need to clean up
 -        return 0
+-
 -    # obtain a list of LRP artifacts
 -    LRP_artifacts = cas.list_refs()
+-
 -    removed_size = 0  # in bytes
 -    while object_size - removed_size > free_disk_space:
 -        try:
 -            to_remove = LRP_artifacts.pop(0)  # The first element in the list is the LRP artifact
 -        except IndexError:
 -            # This exception is caught if there are no more artifacts in the list
 -            # LRP_artifacts. This means the the artifact is too large for the filesystem
 -            # so we abort the process
 -            raise ArtifactTooLargeException("Artifact of size {} is too large for "
 -                                            "the filesystem which mounts the remote "
 -                                            "cache".format(object_size))
 +    def __init__(self, cas, max_head_size, min_head_size=int(2e9)):
 +        self.__cas = cas
 +        self.__max_head_size = max_head_size
 +        self.__min_head_size = min_head_size
 -        removed_size += cas.remove(to_remove, defer_prune=False)
 +    def __has_space(self, object_size):
 +        stats = os.statvfs(self.__cas.casdir)
 +        free_disk_space = (stats.f_bavail * stats.f_bsize) - self.__min_head_size
 +        total_disk_space = (stats.f_blocks * stats.f_bsize) - self.__min_head_size
 -    if removed_size > 0:
 -        logging.info("Successfully removed {} bytes from the cache".format(removed_size))
 -    else:
 -        logging.info("No artifacts were removed from the cache.")
 +        if object_size > total_disk_space:
 +            raise ArtifactTooLargeException("Artifact of size: {} is too large for "
 +                                            "the filesystem which mounts the remote "
 +                                            "cache".format(object_size))
 -    return removed_size
 +        return object_size <= free_disk_space
++
 +    # _clean_up_cache()
 +    #
 +    # Keep removing Least Recently Pushed (LRP) artifacts in a cache until there
 +    # is enough space for the incoming artifact
 +    #
 +    # Args:
 +    #   object_size: The size of the object being received in bytes
 +    #
 +    # Returns:
 +    #   int: The total bytes removed on the filesystem
 +    #
 +    def clean_up(self, object_size):
 +        if self.__has_space(object_size):
 +            return 0
++
 +        with _CacheCleaner.__cleanup_cache_lock:
 +            if self.__has_space(object_size):
 +                # Another thread has done the cleanup for us
 +                return 0
++
 +            stats = os.statvfs(self.__cas.casdir)
 +            target_disk_space = (stats.f_bavail * stats.f_bsize) - self.__max_head_size
++
 +            # obtain a list of LRP artifacts
 +            LRP_objects = self.__cas.list_objects()
++
 +            removed_size = 0  # in bytes
++
 +            last_mtime = 0
++
 +            while object_size - removed_size > target_disk_space:
 +                try:
 +                    last_mtime, to_remove = LRP_objects.pop(0)  # The first element in the list is the LRP artifact
 +                except IndexError:
 +                    # This exception is caught if there are no more artifacts in the list
 +                    # LRP_artifacts. This means the the artifact is too large for the filesystem
 +                    # so we abort the process
 +                    raise ArtifactTooLargeException("Artifact of size {} is too large for "
 +                                                    "the filesystem which mounts the remote "
 +                                                    "cache".format(object_size))
++
 +                try:
 +                    size = os.stat(to_remove).st_size
 +                    os.unlink(to_remove)
 +                    removed_size += size
 +                except FileNotFoundError:
 +                    pass
++
 +            self.__cas.clean_up_refs_until(last_mtime)
++
 +            if removed_size > 0:
 +                logging.info("Successfully removed {} bytes from the cache".format(removed_size))
 +            else:
 +                logging.info("No artifacts were removed from the cache.")
++
 +            return removed_size

buildstream/_platform/linux.py

@@ -18,9 +18,9 @@
  #        Tristan Maat <tristan maat codethink co uk>
  import os
 -import shutil
  import subprocess
 +from .. import _site
  from .. import utils
  from ..sandbox import SandboxDummy
@@ -38,16 +38,18 @@ class Linux(Platform):
          self._have_fuse = os.path.exists("/dev/fuse")
 -        bwrap_version = self._get_bwrap_version()
 +        bwrap_version = _site.get_bwrap_version()
          if bwrap_version is None:
              self._bwrap_exists = False
              self._have_good_bwrap = False
              self._die_with_parent_available = False
 +            self._json_status_available = False
          else:
              self._bwrap_exists = True
              self._have_good_bwrap = (0, 1, 2) <= bwrap_version
              self._die_with_parent_available = (0, 1, 8) <= bwrap_version
 +            self._json_status_available = (0, 3, 2) <= bwrap_version
          self._local_sandbox_available = self._have_fuse and self._have_good_bwrap
@@ -97,6 +99,7 @@ class Linux(Platform):
          # Inform the bubblewrap sandbox as to whether it can use user namespaces or not
          kwargs['user_ns_available'] = self._user_ns_available
          kwargs['die_with_parent_available'] = self._die_with_parent_available
 +        kwargs['json_status_available'] = self._json_status_available
          return SandboxBwrap(*args, **kwargs)
      def _check_user_ns_available(self):
@@ -119,21 +122,3 @@ class Linux(Platform):
              output = ''
          return output == 'root'
+-
 -    def _get_bwrap_version(self):
 -        # Get the current bwrap version
 -        #
 -        # returns None if no bwrap was found
 -        # otherwise returns a tuple of 3 int: major, minor, patch
 -        bwrap_path = shutil.which('bwrap')
+-
 -        if not bwrap_path:
 -            return None
+-
 -        cmd = [bwrap_path, "--version"]
 -        try:
 -            version = str(subprocess.check_output(cmd).split()[1], "utf-8")
 -        except subprocess.CalledProcessError:
 -            return None
+-
 -        return tuple(int(x) for x in version.split("."))

buildstream/_project.py

@@ -219,6 +219,19 @@ class Project():
          return self._cache_key
 +    def _validate_node(self, node):
 +        _yaml.node_validate(node, [
 +            'format-version',
 +            'element-path', 'variables',
 +            'environment', 'environment-nocache',
 +            'split-rules', 'elements', 'plugins',
 +            'aliases', 'name',
 +            'artifacts', 'options',
 +            'fail-on-overlap', 'shell', 'fatal-warnings',
 +            'ref-storage', 'sandbox', 'mirrors', 'remote-execution',
 +            'sources', '(@)'
 +        ])
++
      # create_element()
+     #
      # Instantiate and return an element
@@ -402,6 +415,8 @@ class Project():
                  "Project requested format version {}, but BuildStream {}.{} only supports up until format version {}"
                  .format(format_version, major, minor, BST_FORMAT_VERSION))
 +        self._validate_node(pre_config_node)
++
          # FIXME:
+         #
          #   Performing this check manually in the absense
@@ -467,16 +482,7 @@ class Project():
          self._load_pass(config, self.config)
 -        _yaml.node_validate(config, [
 -            'format-version',
 -            'element-path', 'variables',
 -            'environment', 'environment-nocache',
 -            'split-rules', 'elements', 'plugins',
 -            'aliases', 'name',
 -            'artifacts', 'options',
 -            'fail-on-overlap', 'shell', 'fatal-warnings',
 -            'ref-storage', 'sandbox', 'mirrors', 'remote-execution'
 -        ])
 +        self._validate_node(config)
+         #
          # Now all YAML composition is done, from here on we just load

buildstream/_site.py

@@ -18,6 +18,8 @@
  #        Tristan Van Berkom <tristan vanberkom codethink co uk>
  import os
 +import shutil
 +import subprocess
+ #
  # Private module declaring some info about where the buildstream
@@ -44,3 +46,22 @@ build_all_template = os.path.join(root, 'data', 'build-all.sh.in')
  # Module building script template
  build_module_template = os.path.join(root, 'data', 'build-module.sh.in')
++
++
 +def get_bwrap_version():
 +    # Get the current bwrap version
 +    #
 +    # returns None if no bwrap was found
 +    # otherwise returns a tuple of 3 int: major, minor, patch
 +    bwrap_path = shutil.which('bwrap')
++
 +    if not bwrap_path:
 +        return None
++
 +    cmd = [bwrap_path, "--version"]
 +    try:
 +        version = str(subprocess.check_output(cmd).split()[1], "utf-8")
 +    except subprocess.CalledProcessError:
 +        return None
++
 +    return tuple(int(x) for x in version.split("."))

buildstream/sandbox/_sandboxbwrap.py

@@ -17,6 +17,8 @@
  #  Authors:
  #        Andrew Leeming <andrew leeming codethink co uk>
  #        Tristan Van Berkom <tristan vanberkom codethink co uk>
 +import collections
 +import json
  import os
  import sys
  import time
@@ -24,7 +26,8 @@ import errno
  import signal
  import subprocess
  import shutil
 -from contextlib import ExitStack
 +from contextlib import ExitStack, suppress
 +from tempfile import TemporaryFile
  import psutil
@@ -53,6 +56,7 @@ class SandboxBwrap(Sandbox):
          super().__init__(*args, **kwargs)
          self.user_ns_available = kwargs['user_ns_available']
          self.die_with_parent_available = kwargs['die_with_parent_available']
 +        self.json_status_available = kwargs['json_status_available']
      def run(self, command, flags, *, cwd=None, env=None):
          stdout, stderr = self._get_output()
@@ -160,24 +164,31 @@ class SandboxBwrap(Sandbox):
                  gid = self._get_config().build_gid
                  bwrap_command += ['--uid', str(uid), '--gid', str(gid)]
 -        # Add the command
 -        bwrap_command += command
+-
 -        # bwrap might create some directories while being suid
 -        # and may give them to root gid, if it does, we'll want
 -        # to clean them up after, so record what we already had
 -        # there just in case so that we can safely cleanup the debris.
 -        #
 -        existing_basedirs = {
 -            directory: os.path.exists(os.path.join(root_directory, directory))
 -            for directory in ['tmp', 'dev', 'proc']
 -        }
+-
 -        # Use the MountMap context manager to ensure that any redirected
 -        # mounts through fuse layers are in context and ready for bwrap
 -        # to mount them from.
 -        #
          with ExitStack() as stack:
 +            pass_fds = ()
 +            # Improve error reporting with json-status if available
 +            if self.json_status_available:
 +                json_status_file = stack.enter_context(TemporaryFile())
 +                pass_fds = (json_status_file.fileno(),)
 +                bwrap_command += ['--json-status-fd', str(json_status_file.fileno())]
++
 +            # Add the command
 +            bwrap_command += command
++
 +            # bwrap might create some directories while being suid
 +            # and may give them to root gid, if it does, we'll want
 +            # to clean them up after, so record what we already had
 +            # there just in case so that we can safely cleanup the debris.
 +            #
 +            existing_basedirs = {
 +                directory: os.path.exists(os.path.join(root_directory, directory))
 +                for directory in ['tmp', 'dev', 'proc']
 +            }
++
 +            # Use the MountMap context manager to ensure that any redirected
 +            # mounts through fuse layers are in context and ready for bwrap
 +            # to mount them from.
 +            #
              stack.enter_context(mount_map.mounted(self))
              # If we're interactive, we want to inherit our stdin,
@@ -190,7 +201,7 @@ class SandboxBwrap(Sandbox):
              # Run bubblewrap !
              exit_code = self.run_bwrap(bwrap_command, stdin, stdout, stderr,
 -                                       (flags & SandboxFlags.INTERACTIVE))
 +                                       (flags & SandboxFlags.INTERACTIVE), pass_fds)
              # Cleanup things which bwrap might have left behind, while
              # everything is still mounted because bwrap can be creating
@@ -238,10 +249,27 @@ class SandboxBwrap(Sandbox):
                          # a bug, bwrap mounted a tempfs here and when it exits, that better be empty.
                          pass
 +            if self.json_status_available:
 +                json_status_file.seek(0, 0)
 +                child_exit_code = None
 +                # The JSON status file's output is a JSON object per line
 +                # with the keys present identifying the type of message.
 +                # The only message relevant to us now is the exit-code of the subprocess.
 +                for line in json_status_file:
 +                    with suppress(json.decoder.JSONDecodeError):
 +                        o = json.loads(line)
 +                        if isinstance(o, collections.abc.Mapping) and 'exit-code' in o:
 +                            child_exit_code = o['exit-code']
 +                            break
 +                if child_exit_code is None:
 +                    raise SandboxError("`bwrap' terminated during sandbox setup with exitcode {}".format(exit_code),
 +                                       reason="bwrap-sandbox-fail")
 +                exit_code = child_exit_code
++
          self._vdir._mark_changed()
          return exit_code
 -    def run_bwrap(self, argv, stdin, stdout, stderr, interactive):
 +    def run_bwrap(self, argv, stdin, stdout, stderr, interactive, pass_fds):
          # Wrapper around subprocess.Popen() with common settings.
+         #
          # This function blocks until the subprocess has terminated.
@@ -317,6 +345,7 @@ class SandboxBwrap(Sandbox):
                  # The default is to share file descriptors from the parent process
                  # to the subprocess, which is rarely good for sandboxing.
                  close_fds=True,
 +                pass_fds=pass_fds,
                  stdin=stdin,
                  stdout=stdout,
                  stderr=stderr,

tests/cachekey/cachekey.py

@@ -36,7 +36,7 @@
  # the result.
+ #
  from tests.testutils.runcli import cli
 -from tests.testutils.site import HAVE_BZR, HAVE_GIT, HAVE_OSTREE, IS_LINUX
 +from tests.testutils.site import HAVE_BZR, HAVE_GIT, HAVE_OSTREE, IS_LINUX, MACHINE_ARCH
  from buildstream.plugin import CoreWarnings
  from buildstream import _yaml
  import os
@@ -144,6 +144,8 @@ DATA_DIR = os.path.join(
  # The cache key test uses a project which exercises all plugins,
  # so we cant run it at all if we dont have them installed.
+ #
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Cache keys depend on architecture')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.skipif(HAVE_BZR is False, reason="bzr is not available")
  @pytest.mark.skipif(HAVE_GIT is False, reason="git is not available")

tests/examples/autotools.py

@@ -3,7 +3,7 @@ import pytest
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import IS_LINUX
 +from tests.testutils.site import IS_LINUX, MACHINE_ARCH
  pytestmark = pytest.mark.integration
@@ -13,6 +13,8 @@ DATA_DIR = os.path.join(
  # Tests a build of the autotools amhello project on a alpine-linux base runtime
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_autotools_build(cli, tmpdir, datafiles):
@@ -36,6 +38,8 @@ def test_autotools_build(cli, tmpdir, datafiles):
  # Test running an executable built with autotools.
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_autotools_run(cli, tmpdir, datafiles):

tests/examples/developing.py

@@ -4,7 +4,7 @@ import pytest
  import tests.testutils.patch as patch
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import IS_LINUX
 +from tests.testutils.site import IS_LINUX, MACHINE_ARCH
  pytestmark = pytest.mark.integration
@@ -14,6 +14,8 @@ DATA_DIR = os.path.join(
  # Test that the project builds successfully
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_autotools_build(cli, tmpdir, datafiles):
@@ -35,6 +37,8 @@ def test_autotools_build(cli, tmpdir, datafiles):
  # Test the unmodified hello command works as expected.
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_run_unmodified_hello(cli, tmpdir, datafiles):
@@ -66,6 +70,8 @@ def test_open_workspace(cli, tmpdir, datafiles):
  # Test making a change using the workspace
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_make_change_in_workspace(cli, tmpdir, datafiles):

tests/examples/flatpak-autotools.py

@@ -3,7 +3,7 @@ import pytest
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import IS_LINUX
 +from tests.testutils.site import IS_LINUX, MACHINE_ARCH
  pytestmark = pytest.mark.integration
@@ -32,6 +32,8 @@ def workaround_setuptools_bug(project):
  # Test that a build upon flatpak runtime 'works' - we use the autotools sample
  # amhello project for this.
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_autotools_build(cli, tmpdir, datafiles):
@@ -55,6 +57,8 @@ def test_autotools_build(cli, tmpdir, datafiles):
  # Test running an executable built with autotools
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_autotools_run(cli, tmpdir, datafiles):

tests/examples/integration-commands.py

@@ -3,7 +3,7 @@ import pytest
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import IS_LINUX
 +from tests.testutils.site import IS_LINUX, MACHINE_ARCH
  pytestmark = pytest.mark.integration
@@ -12,6 +12,8 @@ DATA_DIR = os.path.join(
+ )
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_integration_commands_build(cli, tmpdir, datafiles):
@@ -23,6 +25,8 @@ def test_integration_commands_build(cli, tmpdir, datafiles):
  # Test running the executable
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_integration_commands_run(cli, tmpdir, datafiles):

tests/examples/junctions.py

@@ -3,7 +3,7 @@ import pytest
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import IS_LINUX
 +from tests.testutils.site import IS_LINUX, MACHINE_ARCH
  pytestmark = pytest.mark.integration
@@ -13,6 +13,8 @@ DATA_DIR = os.path.join(
  # Test that the project builds successfully
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_build(cli, tmpdir, datafiles):
@@ -23,6 +25,8 @@ def test_build(cli, tmpdir, datafiles):
  # Test the callHello script works as expected.
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_shell_call_hello(cli, tmpdir, datafiles):

tests/examples/running-commands.py

@@ -3,7 +3,7 @@ import pytest
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import IS_LINUX
 +from tests.testutils.site import IS_LINUX, MACHINE_ARCH
  pytestmark = pytest.mark.integration
@@ -12,6 +12,8 @@ DATA_DIR = os.path.join(
+ )
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_running_commands_build(cli, tmpdir, datafiles):
@@ -23,6 +25,8 @@ def test_running_commands_build(cli, tmpdir, datafiles):
  # Test running the executable
 +@pytest.mark.skipif(MACHINE_ARCH != 'x86_64',
 +                    reason='Examples are writtent for x86_64')
  @pytest.mark.skipif(not IS_LINUX, reason='Only available on linux')
  @pytest.mark.datafiles(DATA_DIR)
  def test_running_commands_run(cli, tmpdir, datafiles):

tests/format/list-directive-type-error/project.conf

@@ -4,4 +4,4 @@ options:
    arch:
      type: arch
      description: Example architecture option
 -    values: [ x86_32, x86_64 ]
 +    values: [ x86_32, x86_64, aarch64 ]

tests/frontend/invalid_element_path/project.conf

 +# Project config for frontend build test
 +name: test
++
 +elephant-path: elements

tests/frontend/push.py

@@ -230,6 +230,8 @@ def test_artifact_expires(cli, datafiles, tmpdir):
      # Create an artifact share (remote artifact cache) in the tmpdir/artifactshare
      # Mock a file system with 12 MB free disk space
      with create_artifact_share(os.path.join(str(tmpdir), 'artifactshare'),
 +                               min_head_size=int(2e9),
 +                               max_head_size=int(2e9),
                                 total_space=int(10e9), free_space=(int(12e6) + int(2e9))) as share:
          # Configure bst to push to the cache
@@ -313,6 +315,8 @@ def test_recently_pulled_artifact_does_not_expire(cli, datafiles, tmpdir):
      # Create an artifact share (remote cache) in tmpdir/artifactshare
      # Mock a file system with 12 MB free disk space
      with create_artifact_share(os.path.join(str(tmpdir), 'artifactshare'),
 +                               min_head_size=int(2e9),
 +                               max_head_size=int(2e9),
                                 total_space=int(10e9), free_space=(int(12e6) + int(2e9))) as share:
          # Configure bst to push to the cache

tests/frontend/show.py

@@ -36,6 +36,19 @@ def test_show(cli, datafiles, target, format, expected):
                               .format(expected, result.output))
 +@pytest.mark.datafiles(os.path.join(
 +    os.path.dirname(os.path.realpath(__file__)),
 +    "invalid_element_path",
 +))
 +def test_show_invalid_element_path(cli, datafiles):
 +    project = os.path.join(datafiles.dirname, datafiles.basename)
 +    result = cli.run(project=project, silent=True, args=[
 +        'show',
 +        "foo.bst"])
++
 +    result.assert_main_error(ErrorDomain.LOAD, LoadErrorReason.INVALID_DATA)
++
++
  @pytest.mark.datafiles(DATA_DIR)
  @pytest.mark.parametrize("target,except_,expected", [
      ('target.bst', 'import-bin.bst', ['import-dev.bst', 'compose-all.bst', 'target.bst']),

tests/integration/project/elements/base/base-alpine.bst

@@ -7,6 +7,11 @@ description: |
  sources:
    - kind: tar
 -    url: alpine:integration-tests-base.v1.x86_64.tar.xz
      base-dir: ''
 -    ref: 3eb559250ba82b64a68d86d0636a6b127aa5f6d25d3601a79f79214dc9703639
 +    (?):
 +    - arch == "x86_64":
 +        ref: 3eb559250ba82b64a68d86d0636a6b127aa5f6d25d3601a79f79214dc9703639
 +        url: "alpine:integration-tests-base.v1.x86_64.tar.xz"
 +    - arch == "aarch64":
 +        ref: 431fb5362032ede6f172e70a3258354a8fd71fcbdeb1edebc0e20968c792329a
 +        url: "alpine:integration-tests-base.v1.aarch64.tar.xz"

tests/integration/project/elements/sandbox-bwrap/break-shell.bst

 +kind: manual
 +depends:
 +  - base/base-alpine.bst
++
 +public:
 +  bst:
 +    integration-commands:
 +    - |
 +      chmod a-x /bin/sh

tests/integration/project/elements/sandbox-bwrap/command-exit-42.bst

 +kind: manual
 +depends:
 +  - base/base-alpine.bst
++
 +config:
 +  build-commands:
 +  - |
 +    exit 42

tests/integration/project/elements/sandbox-bwrap/non-executable-shell.bst

 +kind: manual
++
 +depends:
 +  - sandbox-bwrap/break-shell.bst
++
 +config:
 +  build-commands:
 +  - |
 +    exit 42

tests/integration/project/project.conf

@@ -9,6 +9,12 @@ options:
      type: bool
      description: Whether to expect a linux platform
      default: True
 +  arch:
 +    type: arch
 +    description: Current architecture
 +    values:
 +      - x86_64
 +      - aarch64
  split-rules:
    test:
      - |

tests/integration/sandbox-bwrap.py

  import os
  import pytest
 +from buildstream._exceptions import ErrorDomain
++
  from tests.testutils import cli_integration as cli
  from tests.testutils.integration import assert_contains
 -from tests.testutils.site import HAVE_BWRAP
 +from tests.testutils.site import HAVE_BWRAP, HAVE_BWRAP_JSON_STATUS
  pytestmark = pytest.mark.integration
@@ -29,3 +31,32 @@ def test_sandbox_bwrap_cleanup_build(cli, tmpdir, datafiles):
      # Here, BuildStream should not attempt any rmdir etc.
      result = cli.run(project=project, args=['build', element_name])
      assert result.exit_code == 0
++
++
 +@pytest.mark.skipif(not HAVE_BWRAP, reason='Only available with bubblewrap')
 +@pytest.mark.skipif(not HAVE_BWRAP_JSON_STATUS, reason='Only available with bubblewrap supporting --json-status-fd')
 +@pytest.mark.datafiles(DATA_DIR)
 +def test_sandbox_bwrap_distinguish_setup_error(cli, tmpdir, datafiles):
 +    project = os.path.join(datafiles.dirname, datafiles.basename)
 +    element_name = 'sandbox-bwrap/non-executable-shell.bst'
++
 +    result = cli.run(project=project, args=['build', element_name])
 +    result.assert_task_error(error_domain=ErrorDomain.SANDBOX, error_reason="bwrap-sandbox-fail")
++
++
 +@pytest.mark.integration
 +@pytest.mark.skipif(not HAVE_BWRAP, reason='Only available with bubblewrap')
 +@pytest.mark.datafiles(DATA_DIR)
 +def test_sandbox_bwrap_return_subprocess(cli, tmpdir, datafiles):
 +    project = os.path.join(datafiles.dirname, datafiles.basename)
 +    element_name = 'sandbox-bwrap/command-exit-42.bst'
++
 +    cli.configure({
 +        "logging": {
 +            "message-format": "%{element}|%{message}",
 +        },
 +    })
++
 +    result = cli.run(project=project, args=['build', element_name])
 +    result.assert_task_error(error_domain=ErrorDomain.ELEMENT, error_reason=None)
 +    assert "sandbox-bwrap/command-exit-42.bst|Command 'exit 42' failed with exitcode 42" in result.stderr

tests/testutils/artifactshare.py

@@ -29,7 +29,11 @@ from buildstream._protos.build.bazel.remote.execution.v2 import remote_execution
+ #
  class ArtifactShare():
 -    def __init__(self, directory, *, total_space=None, free_space=None):
 +    def __init__(self, directory, *,
 +                 total_space=None,
 +                 free_space=None,
 +                 min_head_size=int(2e9),
 +                 max_head_size=int(10e9)):
          # The working directory for the artifact share (in case it
          # needs to do something outside of its backend's storage folder).
@@ -53,6 +57,9 @@ class ArtifactShare():
          self.total_space = total_space
          self.free_space = free_space
 +        self.max_head_size = max_head_size
 +        self.min_head_size = min_head_size
++
          q = Queue()
          self.process = Process(target=self.run, args=(q,))
@@ -76,7 +83,10 @@ class ArtifactShare():
                  self.free_space = self.total_space
              os.statvfs = self._mock_statvfs
 -        server = create_server(self.repodir, enable_push=True)
 +        server = create_server(self.repodir,
 +                               max_head_size=self.max_head_size,
 +                               min_head_size=self.min_head_size,
 +                               enable_push=True)
          port = server.add_insecure_port('localhost:0')
          server.start()
@@ -134,6 +144,15 @@ class ArtifactShare():
          try:
              tree = self.cas.resolve_ref(artifact_key)
 +            reachable = set()
 +            try:
 +                self.cas._reachable_refs_dir(reachable, tree, update_mtime=False)
 +            except FileNotFoundError:
 +                return False
 +            for digest in reachable:
 +                object_name = os.path.join(self.cas.casdir, 'objects', digest[:2], digest[2:])
 +                if not os.path.exists(object_name):
 +                    return False
              return True
          except CASError:
              return False
@@ -165,8 +184,11 @@ class ArtifactShare():
  # Create an ArtifactShare for use in a test case
+ #
  @contextmanager
 -def create_artifact_share(directory, *, total_space=None, free_space=None):
 -    share = ArtifactShare(directory, total_space=total_space, free_space=free_space)
 +def create_artifact_share(directory, *, total_space=None, free_space=None,
 +                          min_head_size=int(2e9),
 +                          max_head_size=int(10e9)):
 +    share = ArtifactShare(directory, total_space=total_space, free_space=free_space,
 +                          min_head_size=min_head_size, max_head_size=max_head_size)
      try:
          yield share
      finally:

tests/testutils/site.py

@@ -4,7 +4,7 @@
  import os
  import sys
 -from buildstream import utils, ProgramNotFoundError
 +from buildstream import _site, utils, ProgramNotFoundError
  try:
      utils.get_host_tool('bzr')
@@ -33,8 +33,10 @@ except (ImportError, ValueError):
  try:
      utils.get_host_tool('bwrap')
      HAVE_BWRAP = True
 +    HAVE_BWRAP_JSON_STATUS = _site.get_bwrap_version() >= (0, 3, 2)
  except ProgramNotFoundError:
      HAVE_BWRAP = False
 +    HAVE_BWRAP_JSON_STATUS = False
  try:
      utils.get_host_tool('lzip')
@@ -49,3 +51,5 @@ except ImportError:
      HAVE_ARPY = False
  IS_LINUX = os.getenv('BST_FORCE_BACKEND', sys.platform).startswith('linux')
++
 +_, _, _, _, MACHINE_ARCH = os.uname()

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]