Re: [BuildStream] Proposal: A small number of subprocesses handling jobs

From: Jonathan Maw <jonathan maw codethink co uk>
To: buildstream-list gnome org
Subject: Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
Date: Mon, 04 Mar 2019 15:50:01 +0000

On 2019-03-04 10:01, Tristan Van Berkom via buildstream-list wrote:

On Mon, 2019-03-04 at 10:40 +0100, Jürg Billeter wrote:

On Mon, 2019-03-04 at 18:07 +0900, Tristan Van Berkom wrote:
> On Mon, 2019-03-04 at 18:02 +0900, Tristan Van Berkom via
> buildstream-list wrote:
> > Hi,
>
> [...]
> > I would personally rather be reluctant about imposing this explicit
> > parallelism knowledge on the plugin API, and seek other
> > justifications (performance ?) before making plugins aware of what is
> > processed where.

Command batching already breaks that serial plugin API, though. My
proposal would simply take this further to cover staging.

One possibly significant difference is that plugins don't have to use
command batching, while my proposal would likely no longer allow
plugins to ignore it.

> So here is another idea...
>
> What if we elaborated and generalized further on state synchronization,
> such that:
>
> * We had a dedicated process pool
> * Plugins are run in the process pool
> * The Element/Source API used in a worker pool would obtain state from
>   the main process on demand, such that we load state lazily on demand
>   by querying the main process.
> * The core's responsibility then is to:
>   * Dispatch jobs in worker threads
>   * Respond to generic get/set requests to the data model in the
>     main process
>   * Update the UI periodically
>
> This approach might incur a lot more context switching between
> processes where plugins load state from the main process, but python
> processing remains parallelized and the whole implementation is
> transparent to the plugin.

On initial thoughts, this doesn't sound very appealing to me with
regards to implementation complexity and maintainability. Or can you
think of a way to implement this without significant extra complexity
in the core?


Firstly, I don't want to avoid significant extra complexity in the
core, I want to avoid *any* complexity in a plugin.

I personally attribute a huge amount of value to how simple the plugin
API is, and am happy to trade 1kloc in the core, under our control, in
order to avoid 1loc in the plugins which users should be able to easily
write themselves.

As an implementation, I would imagine perhaps we have a separate
subprocess to manage actual state and have both the main process and
worker processes be clients to that state-serving process.

The core Element and Source APIs would pass through a separate
ElementProxy and SourceProxy API for storing and loading state,
probably this would amount to a lot of boiler plate code, more than it
would amount to complexity - to fine tune things for performance we
would probably end up making compromises with lazy loading.

It might not be the best approach, the best approach might very well be
the current fork() on demand model we already have (unless it really
*is* costing us too much time while forking, which I think still needs
to be proven).

Cheers,
    -Tristan

Hi,

Today I had a discussion with Daniel, James, Chandan, Gokcen, Angelosand Chiara about the subprocess model, and our conclusions were broadlythe same.

We discussed the possibility of a thread-based model, but didn't take itparticularly seriously or explore implementation in detail because ofthe effect the GIL would have on virtual filesystem behaviour.

We discussed the pool of subprocesses, and came to broadly two differentways to go about it:


1. A multiprocessing pool that forks off at the start of the scheduler.
===

* Changes to the element in the lifetime of a job will be captured, andpassed through the job's result object when the job finishes.* Changes from a job result will be received by the scheduler and pushedto each worker subprocess.* Mandate that only the element that the job is running for can bechanged in the job- A "soft" mandate (changes will not be propagated to the otherworkers) is enough for normal operation, but a separate mode where suchchanges are forbidden (or any changes outside the element are thrownaway) would be useful for debugging.* A "pristine" subprocess that is unchanged by jobs would be useful forforking new subprocesses, especially if we decide that each workershould have a finite lifetime.


2. An element graph service
===

i.e. one subprocess holds the pipeline and uses some form of IPC to getand set state changes.

This is a valuable long-term goal to work towards, once we have a muchbetter idea of where/when we access the element graph, and havecompletely encapsulated every time a plugin would access the elementgraph.At this point, that process will be acutely affected by any slowness in`_update_state()`, and if plugin authors can affect this then we willhave to hope/impress/demand that this should have a small time impact.(As an aside, this is currently not the case. git-based pluginsimplement `validate_cache()` and fork off a git subprocess to find thebranch and tag. using libgit2 here would be valuable)


===

Overall, our next actions in this regard will be:

1. Continue studying/simplifying/splitting up `_update_state()`, as themore we understand the ways that the element's state is altered, thebetter.2. Work out all the places where elements read/write to the elementgraph, so we can identify which parts of the API can be extended withstate tracking (for returning all state changes in a build result, orcalling an element graph service), and consider adding new methods forthe places where the graph is affected directly.


Best regards,

Jonathan
--
Jonathan Maw, Software Engineer, Codethink Ltd.
Codethink privacy policy: https://www.codethink.co.uk/privacy.html

Follow-Ups:
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Tristan Van Berkom

References:
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Jürg Billeter
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Tristan Van Berkom
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Jürg Billeter
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Tristan Van Berkom
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Jürg Billeter
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Tristan Van Berkom
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Tristan Van Berkom
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Jürg Billeter
- Re: [BuildStream] Proposal: A small number of subprocesses handling jobs
  - From: Tristan Van Berkom

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]