high-level tcp bindings for std #2409

olsonjeffery · 2012-05-18T17:28:56Z

quick summary

This pull req includes high-level TCP/IP bindings for the rust stdlib (both server and client API), under the std::net::tcp module. I also did some reshuffling of the IP stuff and pushed it into the net::ip module.

also: adding result::unwrap (from a patch from @nmatsakis) and ignore'd a test on std::timer that is starting to fail more and more, as perf chokes due to valgrind and threading behavior (the test is time-sensitive, so probably poorly conceived to begin with)

only slightly more in-depth

An example demonstrating the TCP client/request API can be found here.

Interestingly, because of API-style friction between libuv and rust, I found it neccesary to expose two separate UIs to the TCP server API. Quickly, with examples, they are:

the net::tcp::listen_for_conn entry point, an example of which is here. This call is blocking for the lifetime of the TCP server connection and uses a callback running on the libuv loop (!).
and the net::tcp::new_listener entry point, an example of which can be found here. This bit of API, on successful setup, returns a (vaguely) comm::port-like object that uses can peek/recv on (using custom API), as needed, to get new connections.

The new_listener API varies, subtly, from the listen_for_conn entry point because it accepts new connections immediately (with the overhead that this entails), while listen_for_conn gives a user the chance to manually accept a connection, or just drop it, in line with the default behavior of libuv.

To more fully articulate this, I'm just going to paste my commit msg from 95f0b90:

we now have two interfaces for the TCP/IP server/listener workflow,
based on different user approaches surrounding how to deal with the
flow of accept a new tcp connection:

the "original" API closely mimics the low-level libuv API, in that we
have an on_connect_cb that the user provides that is ran on the libuv
thread. In this callback, the user can accept() a connection, turning it
into a tcp_socket.. of course, before accepting, they have the option
of passing it to a new task, provided they make the cb block until
the accept is done .. this is because, in libuv, you have to do the
uv_accept call in the span of that on_connect_cb callback that gets fired
when a new connection comes in. thems the breaks..

I wanted to just get rid of this API, because the general proposition of
users always running code on the libuv thread sounds like an invitation
for many future headaches. the API restriction to have to choose to
immediately accept a connection (and allow the user to block libuv as
needed) isn't too bad for power users who could conceive of circumstances
where they would drop an incoming TCP connection and know what they're
doing, in general.

but as a general API, I thought this was a bit cumbersome, so I ended up
devising..

an API that is initiated with a call to net::tcp::new_listener() ..
has a similar signature to net::tcp::listen(), except that is just
returns an object that sort of behaves like a comm::port. Users can
block on the tcp_conn_port to receive new connections, either in the
current task or in a new task, depending on which API route they take
(net::tcp::conn_recv or net::tcp::conn_recv_spawn respectively).. there
is also a net::tcp::conn_peek function that will do a peek on the
underlying port to see if there are pending connections.

The main difference, with this API, is that the low-level libuv glue is
going to accept every connection attempt, along with the overhead that
that brings. But, this is a much more hassle-free API for 95% of use
cases and will probably be the one that most users will want to reach for.

what's needed to fill this out

IPv6! I stubbed out the data structure, but parsing IPv6 addr strings is non-trivial, so I'm not sure how to tackle this from rust, just yet. If anyone wants to pitch in, that'd be swell.
The other bits of the tcp API. I only added connection-establishment for clients/server and read/write over TCP streams.. so stuff like setting keep-alive, etc needs to be added.

pcwalton · 2012-05-19T03:27:09Z

src/libstd/net_tcp.rs

+    let server_data_ptr = uv::ll::get_data_for_uv_handle(handle)
+        as *tcp_listen_fc_data;
+    let kill_ch = (*server_data_ptr).kill_ch;
+    alt (*server_data_ptr).active {


How about if here?

brson · 2012-05-19T23:38:05Z

How safe is conn_recv_spawn? Do you have to ensure the server_port outlives the task handling the connection?

olsonjeffery · 2012-05-20T14:24:45Z

@brson ok, so once the new_conn_po in conn_recv_spawn recvs, we have a valid stream for the client. We then pass that valid stream into a new task, where a tcp_socket is created and is then passed to the user-supplied callback. At this point, conn_recv_spawn returns.. So if the server_port goes out of scope and tears down during/before the body of the callback is ran, then operations against the client stream (contained in the tcp_socket) will error out.

Ideally, the user would keep the server_port bottled up in a loop or somesuch, so this wouldn't be an issue. But if it does happen, it won't be a catostrophic failure/segfault, since the stream contained in the tcp_socket is a different one from the stream contained in the server_port/tcp_conn_port (the client stream is created in tcp_nl_on_connection_cb L884, with the call to malloc_uv_tcp_t)

…ting

* tweaked the layout of sockaddr_in6 struct in anticipation of future use * changed several uv:ll fn signatures to use generics and be more flexible with ptr types they get passed * add uv_err_data and a help fn to return it.. packages up err_name and err_msg info from uv_get_last_error() stuff..

they're changed into a net::tcp::tcp_err_data record, for now. once the scope of possible tcp errors, from libuv, is established ill create an err type for each one and return those where they might occur

more flexibility..

still need implementation for parsing/output formatting and (perhaps?) representation (for now, i just followef the ipv4 variant's lead and am representing it as a tuple of 8x u16). parsing an ipv6 addr is way more complex than parsing an ipv4 addr, so i'm putting off an implementation here, for now. candidate solutions: - could use getaddrinfo() (exists on both POSIX and windows), but with incompatible fn signatures. - libuv has a way to parse an ipv6 string into a sockaddr_in6, but it also requires a port, so it's probably not aprop for ip_addr

also whitespace cleanup .. for now, the test just spins up the server and listens for messages, echoing them back to an output port. there's a "kill" msg that it will listen for. need to point the tcp client and server test impls at each other for a loopback server/client test, like how its done in uv::ll once ipv6 parse/format lands, i can add another test using the entirely same codebase, but substituting an ip_addr ipv6 varient for the ipv4 varient used in the existing code still need some other plumbing to get the client/server tests to work together.

.. going to rework the listen() API to be non-blocking.

I need these in the context of doing various malloc/free operations for libuv structs that need to live in the heap, because of API workflow (there's no stack to put them in). This has cropped up several times when impl'ing the high-level API for things like timers, but I've decided to take the plunge and use this approach for the net::tcp module. Technically, this can be avoided by spawning a new task that contains the needed memory structures on its stack and then having it block for the duration of the time we need that memory to be valid (this is what I did in std::timer). Exposing this API provides a much lower overhead way to address the issue, albeit with safety concerns. The main mitigation policy should be to use malloc/free with libuv handles only when the handles, are then associated with a resource or class-with-dtor. So we have a finite lifetime for the object and can gaurantee a free(), barring a runtime crash (in which case you have bigger problems!)

.. turns out that, without the export, the modules aren't accessible outside of the crate, itself. I thought that, by importing some module into another (nesting it) and exporting from that nested module (which is, itself, exported from std.rc) that my mod would be in the build artifact. This doesn't appear to be the case. learning is fun!

- we now have two interfaces for the TCP/IP server/listener workflow, based on different user approaches surrounding how to deal with the flow of accept a new tcp connection: 1. the "original" API closely mimics the low-level libuv API, in that we have an on_connect_cb that the user provides *that is ran on the libuv thread*. In this callback, the user can accept() a connection, turning it into a tcp_socket.. of course, before accepting, they have the option of passing it to a new task, provided they *make the cb block until the accept is done* .. this is because, in libuv, you have to do the uv_accept call in the span of that on_connect_cb callback that gets fired when a new connection comes in. thems the breaks.. I wanted to just get rid of this API, because the general proposition of users always running code on the libuv thread sounds like an invitation for many future headaches. the API restriction to have to choose to immediately accept a connection (and allow the user to block libuv as needed) isn't too bad for power users who could conceive of circumstances where they would drop an incoming TCP connection and know what they're doing, in general. but as a general API, I thought this was a bit cumbersome, so I ended up devising.. 2. an API that is initiated with a call to `net::tcp::new_listener()` .. has a similar signature to `net::tcp::listen()`, except that is just returns an object that sort of behaves like a `comm::port`. Users can block on the `tcp_conn_port` to receive new connections, either in the current task or in a new task, depending on which API route they take (`net::tcp::conn_recv` or `net::tcp::conn_recv_spawn` respectively).. there is also a `net::tcp::conn_peek` function that will do a peek on the underlying port to see if there are pending connections. The main difference, with this API, is that the low-level libuv glue is going to *accept every connection attempt*, along with the overhead that that brings. But, this is a much more hassle-free API for 95% of use cases and will probably be the one that most users will want to reach for.

.. this test fails frequently, locally, when ran with the batch of other global_loop tests running due to how valgrind deals with multithreading in the test app. not sure what to do, here.

…arning

* there are a few places where I was experimenting w/ using `alt` in places where `if`/`else` would've sufficed. don't drink the koolaid! * I had an unneeded `else` structure (the `if` branch that preceeded concluded with a `fail` statement.. I added the `fail` later in the dev cycle for this branch, so I forgot to remove the `else` after doing so) * consistent wrt `prop_name: value` vs. `prop_name : value` in record decl and initialization * change an `alt` exp on an `ip_addr` to actually be exhaustive, instead of using a catch-all clause

… write also: read_future ala write_future .. woooooohooooooooo

…them - change port of tcp server test in uv_ll to avoid conflict w/ test in net::tcp - a few places the tcp::read fn is used in test w/ a timeout.. suspend use of the timeout from here on out.

…mer)

brson · 2012-05-23T06:51:46Z

Merged!

Thanks for this huge amount of work and for writing such of thorough description in the pull request.

I've read a fair bit of the code now and I am happy with the direction we're going. The overall design seems to get a little clearer every iteration. I have yet to take a close look at the final tcp interface, though I do intend to. Integrating it with servo might help me get a better feel for where we are - this week I will try to do some experiments with it.

Some miscellaneous things we should consider for the future:

We need to figure out the right abstractions to make the high level APIs memory-safe. Atomic ref counted types will help.
The safe APIs need clearly defined failure modes with accompanying tests. Using the high-level API it ultimately shouldn't be possible to break the uv loop, even when any client tasks fail.
At some point we need to think about how rust's I/O API's (streams in particular) should look and how uv fits in.

I am going to leave this pull request open a while longer because I want to come back to it again.

olsonjeffery · 2012-05-23T12:05:33Z

thanks for fixing timer, with the send/copy kind change. I did a fix last night, but was running tests overnight, so you beat me to it.

100% agreement on all points. we're getting there.

cargo-miri: set RUSTC to us Works around rust-lang/cargo#10885.

These flags allow users to select which visualizations to run---a subset of those in the configuration file. Typically this would be used when the `error_on_regression` is used in CI, or any other visualization that terminates benchcomp with a non-zero return code. The idea is that users can first run all visualizations, except for ones that cause benchcomp to terminate, using `--except error_on_regression`. Users can then save any output files from visualizations, before running `benchcomp visualize --only error_on_regression`.

pcwalton reviewed May 19, 2012
View reviewed changes

olsonjeffery added 18 commits May 22, 2012 19:11

initial stab at API for std::net::tcp

deba9eb

std: pushing existing code in net.rs -> net_ip.rs and re-import/expor…

ad35511

…ting

std: export net::ip::format_addr

0f071a8

std: impl for net::tcp::connect

c5cf2bc

std: tweak uv::ll::write signature and make it generic/more flexible

c6c870a

std: impl of net::tcp::write and make net::tcp::tcp_socket a resource

7cc0646

std: impl for high-level tcp client/request workflow

703121e

std: no longer return uv::ll::err_data records from net::tcp

71b68ca

they're changed into a net::tcp::tcp_err_data record, for now. once the scope of possible tcp errors, from libuv, is established ill create an err type for each one and return those where they might occur

core: add result::unwrap() .. patch from @nmatsakis

d0ca3f7

std: change tcp_*_result to use result::result.. flatter!

959362a

std: makeing uv::ll::listen/accept use generic params for ptr args

b8f3f5d

more flexibility..

std: change sig of uv::ll::accept, again.

d99c7a6

std: tightening up net::tcp, server/client test done, still has races..

6ddf458

.. going to rework the listen() API to be non-blocking.

olsonjeffery added 11 commits May 22, 2012 19:11

std: splitting out tcp server API WIP

adffc3c

std: ignoring timer test that seems to be race-failing b/c valgrind

33ec1cb

.. this test fails frequently, locally, when ran with the batch of other global_loop tests running due to how valgrind deals with multithreading in the test app. not sure what to do, here.

std: more docs and some methods for types in net::tcp

a6ae810

core: doc/err feedback tweeks for result::unwrap

fadff9f

std: add try_parse_addr and change an alt w/ ip_addr::ipv6 to avoid w…

0cabb2f

…arning

std: adding tcp::write_future for non-block tcp writes, docs cleanup

b86839e

std:: adding tcp::read fn as simple, blocking read operation, akin to…

133f52b

… write also: read_future ala write_future .. woooooohooooooooo

std: more work on uv tests to endure valgrind's machinations against …

a97b9f6

…them - change port of tcp server test in uv_ll to avoid conflict w/ test in net::tcp - a few places the tcp::read fn is used in test w/ a timeout.. suspend use of the timeout from here on out.

std: high-level libuv-leverage APIs now take a hl_loop as arg (tcp/ti…

b561469

…mer)

std: warning cleanup

f77892b

brson closed this May 27, 2012

bors added a commit to rust-lang-ci/rust that referenced this pull request Sep 22, 2022

Auto merge of rust-lang#2409 - RalfJung:cargo-miri-rustc, r=RalfJung

ed76d20

cargo-miri: set RUSTC to us Works around rust-lang/cargo#10885.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

high-level tcp bindings for std #2409

high-level tcp bindings for std #2409

Uh oh!

olsonjeffery commented May 18, 2012

Uh oh!

pcwalton May 19, 2012

Uh oh!

brson commented May 19, 2012

Uh oh!

olsonjeffery commented May 20, 2012

Uh oh!

brson commented May 23, 2012

Uh oh!

olsonjeffery commented May 23, 2012

Uh oh!

Uh oh!

high-level tcp bindings for std #2409

high-level tcp bindings for std #2409

Uh oh!

Conversation

olsonjeffery commented May 18, 2012

quick summary

only slightly more in-depth

what's needed to fill this out

Uh oh!

pcwalton May 19, 2012

Choose a reason for hiding this comment

Uh oh!

brson commented May 19, 2012

Uh oh!

olsonjeffery commented May 20, 2012

Uh oh!

brson commented May 23, 2012

Uh oh!

olsonjeffery commented May 23, 2012

Uh oh!

Uh oh!