Optionally call main() in WASI reactors as a convenience. #133

PiotrSikora · 2021-03-01T08:27:39Z

WASI reactors differ from WASI commands in that they have multiple
entrypoints (i.e. proxy_on_* callbacks) instead of only main().

Currently, each Proxy-Wasm SDK uses different approch to startup:

AssemblyScript SDK uses Wasm's start function.
C++ SDK creates WASI reactor with global C++ constructors taking
care of early initialization and registration of plugins.
Rust SDK creates Wasm library, and suggests (via examples) using
_start() function called at startup to do early initialization.
Unfortunately, this is the same function name that WASI commands
are using, which means that WASI constructors cannot be injected
and executed at startup.
TinyGo SDK creates WASI command and calls main() at startup, but
it doesn't exit after main() function returns.

Calling main() in WASI reactors would allow us to prepare for when
they are stablized in Rust, and to have a non-breaking fallback in
case TinyGo decides to exit after main() function returns.

Signed-off-by: Piotr Sikora [email protected]

WASI reactors differ from WASI commands in that they have multiple entrypoints (i.e. proxy_on_* callbacks) instead of only main(). Currently, each Proxy-Wasm SDK uses different approch to startup: - AssemblyScript SDK uses Wasm's start function. - C++ SDK creates WASI reactor with global C++ constructors taking care of early initialization and registration of plugins. - Rust SDK creates Wasm library, and suggests (via examples) using _start() function called at startup to do early initialization. Unfortunately, this is the same function name that WASI commands are using, which means that WASI constructors cannot be injected and executed at startup. - TinyGo SDK creates WASI command and calls main() at startup, but it doesn't exit after main() function returns. Calling main() in WASI reactors would allow us to prepare for when they are stablized in Rust, and to have a non-breaking fallback in case TinyGo decides to exit after main() function returns. Signed-off-by: Piotr Sikora <[email protected]>

PiotrSikora · 2021-03-01T08:31:01Z

We didn't guard _start() or _initialize() previously, but perhaps this should be guarded on the ABI v0.2.2 that @Shikugawa needs for cluster name in gRPC calls?

PiotrSikora · 2021-03-01T08:57:32Z

We didn't guard _start() or _initialize() previously, but perhaps this should be guarded on the ABI v0.2.2 that @Shikugawa needs for cluster name in gRPC calls?

On the other hand, we already call main() in TinyGo SDK, since it's a WASI command, so perhaps this isn't a change after all...

mathetake · 2021-03-02T05:42:43Z

Looks good to me because this seems harmless to add. But as for TinyGo, this won't be a change since I wouldn't export main function literally in TinyGo's WASI target even when we start calling proc_exit at the end of _start. Instead, maybe another name would be adopted something like __tinygo_init or _initialize since calling user defined main alone is not enough because we have to call func init() {} functions in all packages used in programs. I'm the author of WASI target so hopefully I could have a bit of control over the decision.

mathetake · 2021-03-02T05:55:57Z

Ah - I was confused. Maybe we would export _initialize and main in TinyGo since the separation on this line (https://github.com/tinygo-org/tinygo/blob/release/src/runtime/scheduler_none.go#L22) to have _initialize and main makes sense to me, even though this is something to do with TinyGo's core so this may not be accepted by the community.

PiotrSikora · 2021-03-02T07:33:24Z

Looks good to me because this seems harmless to add. But as for TinyGo, this won't be a change since I wouldn't export main function literally in TinyGo's WASI target even when we start calling proc_exit at the end of _start. Instead, maybe another name would be adopted something like __tinygo_init or _initialize since calling user defined main alone is not enough because we have to call func init() {} functions in all packages used in programs. I'm the author of WASI target so hopefully I could have a bit of control over the decision.

Ah - I was confused. Maybe we would export _initialize and main in TinyGo since the separation on this line (https://github.com/tinygo-org/tinygo/blob/release/src/runtime/scheduler_none.go#L22) to have _initialize and main makes sense to me, even though this is something to do with TinyGo's core so this may not be accepted by the community.

TinyGo generates WASI command, which looks like this (pseudocode):

fn _start() {
  init_heap();
  global_constructors();
  main();
}

But in theory WASI command should look something like:

fn _start() {
  init_heap();
  global_constructors();
  retrun_value = main();
  global_destructors();
  proc_exit(return_value);
}

i.e. WASI command should exit after main() completes.

If TinyGo added support for WASI reactors (it should!), then it would look something like this:

fn _initialize() {
  init_heap();
  global_constructors();
}

So, my comment regarding main() and TinyGo was that if TinyGo ever added support for WASI reactors, then it would stop automatically calling main(), since WASI reactors don't do that, and then from the Proxy-Wasm plugin developer point of view, existing TinyGo plugins would stop working since main() would be no longer called.

Does it make sense?

mathetake · 2021-03-02T07:40:24Z

The only thing unclear to me is that calling proc_exit looks not specified in WASI spec, even though we should do that according to the tradition. Do you have any pointer to this?

But yeah, your comment finally totally makes sense to me, and I will work on reactor support in TinyGo in order to make TinyGo more WASI spec compatible. Thanks for the clarification!

mathetake · 2021-03-02T08:03:42Z

Ah - one more thing: maybe we would call main in WASI reactors' _initialize without exiting afterwards. That means

WASI commands would look like

fn _start() {
  init_heap();
  global_constructors();
  retrun_value = main();
  global_destructors();
  proc_exit(return_value);
}

WASI reactors would be like

fn _initialize() {
  init_heap();
  global_constructors();
  main(); // not exit afterwards
}

and then this PR would have no affect on TinyGo. This is kind of a design decision in TinyGo so I am not sure how this would look like when the WASI reactor support lands.

PiotrSikora · 2021-03-02T08:33:03Z

The only thing unclear to me is that calling proc_exit looks not specified in WASI spec, even though we should do that according to the tradition. Do you have any pointer to this?

I don't think it's defined anywhere, WASI spec might have even more tribal knowledge than Proxy-Wasm spec :)

But if you look at Emscripten implementation, this is crt1 for WASI command:

void _start(void) {
  if (__wasm_call_ctors) {
    __wasm_call_ctors();
  }

  /*
   * Will either end up calling the user's original zero argument main directly
   * or our __original_main fallback in __original_main.c which handles
   * populating argv.
   */
  int r = __original_main();

  exit(r);
}

and this is crt1 for WASI reactor:

void _initialize(void) {
  if (__wasm_call_ctors) {
    __wasm_call_ctors();
  }
}

Ah - one more thing: maybe we would call main in WASI reactors' _initialize without exiting afterwards. That means

WASI commands would look like
fn _start() {
  init_heap();
  global_constructors();
  retrun_value = main();
  global_destructors();
  proc_exit(return_value);
}
WASI reactors would be like
fn _initialize() {
  init_heap();
  global_constructors();
  main(); // not exit afterwards
}

No, no, no! WASI reactors have multiple entrypoints, but none is called as part of initialization (that's the whole difference between commands and reactors), so we don't want to call main() from within _initialize().

What we do in this PR is that we call main() (if it's exported by the plugin) after WASI reactor is initialized.

and then this PR would have no affect on TinyGo. This is kind of a design decision in TinyGo so I am not sure how this would look like when the WASI reactor support lands.

This PR doesn't affect TinyGo at this point.

Currently, TinyGo creates WASI command (using _start function), and this PR doesn't change that code path at all.

If TinyGo adds support for WASI reactors (using _initialize function), then the result of this PR is that existing plugins will work without any changes, because we're going to call main() (if it's exported by the plugin) after WASI rector is initialized. Without this PR, they would stop working, since you're hooking into SDK in the main() function.

mathetake · 2021-03-02T09:10:26Z

WASI reactors have multiple entrypoints, but none is called as part of initialization (that's the whole difference between commands and reactors), so we don't want to call main() from within _initialize().

OK, finally everything looks clear to me, and thanks for bearing with me. (And I am now thinking that I should have depended on func init(), not on func main() for plugin initialization)

PiotrSikora requested a review from mathetake as a code owner March 1, 2021 08:27

mathetake approved these changes Mar 2, 2021

View reviewed changes

PiotrSikora merged commit bc6fe15 into proxy-wasm:master Mar 2, 2021

This was referenced Apr 19, 2021

Add notice to wasm docs kgateway-dev/kgateway#4617

Merged

Add C++ WASM filter backward compatibility for 1.7 kgateway-dev/kgateway#4618

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optionally call main() in WASI reactors as a convenience. #133

Optionally call main() in WASI reactors as a convenience. #133

Uh oh!

PiotrSikora commented Mar 1, 2021

Uh oh!

PiotrSikora commented Mar 1, 2021

Uh oh!

PiotrSikora commented Mar 1, 2021

Uh oh!

mathetake commented Mar 2, 2021 •

edited

Loading

Uh oh!

mathetake commented Mar 2, 2021

Uh oh!

PiotrSikora commented Mar 2, 2021

Uh oh!

mathetake commented Mar 2, 2021

Uh oh!

mathetake commented Mar 2, 2021 •

edited

Loading

Uh oh!

PiotrSikora commented Mar 2, 2021

Uh oh!

mathetake commented Mar 2, 2021

Uh oh!

Uh oh!

Optionally call main() in WASI reactors as a convenience. #133

Optionally call main() in WASI reactors as a convenience. #133

Uh oh!

Conversation

PiotrSikora commented Mar 1, 2021

Uh oh!

PiotrSikora commented Mar 1, 2021

Uh oh!

PiotrSikora commented Mar 1, 2021

Uh oh!

mathetake commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mathetake commented Mar 2, 2021

Uh oh!

PiotrSikora commented Mar 2, 2021

Uh oh!

mathetake commented Mar 2, 2021

Uh oh!

mathetake commented Mar 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PiotrSikora commented Mar 2, 2021

Uh oh!

mathetake commented Mar 2, 2021

Uh oh!

Uh oh!

mathetake commented Mar 2, 2021 •

edited

Loading

mathetake commented Mar 2, 2021 •

edited

Loading