Rewrite of the code generation script. #26

eaplatanios · 2019-04-23T03:08:38Z

@rxwei @pschuh This is an attempt to rewrite the code generation script so that it supports the following features:

There is a new flag called mode that you can set to either tfop, eager, or tfop-eager-fallback and it allows you to generate ops using either the #tfop operator or the eager mode C API (based on @pschuh 's previous implementation). tfop-eager-fallback uses tfop wherever possible and eager otherwise (e.g., for output lists).
VariantHandle and ResourceHandle are now also supported, allowing us to replace many of the uses of #tfop in stdlib with calls to Raw functions.
Output tensor lists are now supported when using eager mode.
Function-valued attributes are now also supported in both top and eagermodes._tffunc` is used to trace them.
Added support for ops that support for Tensor<T> and StringTensor. In this case, two functions are generated, one for each case.
The code has been reorganized so that the tfop and eager modes share as much as possible and also makes sure that the API remains the same no matter which mode is used (stdlib can be compile with bindings generated in either mode without any changes).

Only list(func), ref-valued and complex-valued types are not supported by this script now, but this should be fine as they're not that common. This now covers 1096/1286 ops. Out of the remaining 190 ops, 131 are ref-valued, and so this covers almost everything now.

This also helps with the cleaning up of stdlib and transitioning stuff over to swift-apis.

Friend PRs: swiftlang/swift#24261 and tensorflow/swift-apis#109 .

eaplatanios · 2019-04-23T03:46:45Z

@rxwei So, if we add the constructor TensorArrayProtocol.init(_owning:count:) then I can add support for output tensor arrays by using TensorArrayProtocol for the output type in these cases.

rxwei · 2019-04-23T03:49:26Z

@rxwei So, if we add the constructor TensorArrayProtocol.init(_owning:count:) then I can add support for output tensor arrays by using TensorArrayProtocol for the output type in these cases.

That makes sense to me!

eaplatanios · 2019-04-23T03:51:58Z

That makes sense to me!

Cool, I'll go ahead and implement that as an experiment to see if it works out.

eaplatanios · 2019-04-23T04:05:54Z

@rxwei Actually this makes a lot of sense and it revealed something interesting. The Array conformance should now look something like this:

extension Array : TensorArrayProtocol where Element : TensorGroup {
  public func _unpackTensorHandles(into address: UnsafeMutablePointer<CTensorHandle>?) {
    var ptr = address
    for elem in self {
      elem._unpackTensorHandles(into: ptr)
      ptr = ptr!.advanced(by: Int(elem._tensorHandleCount))
    }
  }

  public var _tensorHandleCount: Int32 {
    var count: Int32 = 0
    for elem in self { count += elem._tensorHandleCount }
    return count
  }

  public init(_owning tensorHandles: UnsafePointer<CTensorHandle>?, count: Int) {
    let size = count / Element._tensorHandleCount
    self = Array((0..<size).map { Element(
      _owning: tensorHandles[$0 * Element._tensorHandleCount])
    })
  }
}

That is, Element has to conform to TensorGroup. I don't see how the TensorArrayProtocol alone without the initializer with known count is useful in any setting. For example, in the previous case of extension Array : TensorArrayProtocol where Element : TensorArrayProtocol, you can't really ever construct that without knowing the count thus making it not so useful. Is it ok if I switch to the above implementation? Then I can easily add support for output lists in the bindings.

rxwei · 2019-04-23T04:13:59Z

Actually this makes a lot of sense and it revealed something interesting. It doesn't really make sense to have automatic derivations for TensorArrayProtocol.

Makes sense, though we had to derive TensorArrayProtocol requirements because TensorGroup refines TensorArrayProtocol. Adding an init(_owning:count:) sounds like breaking this inheritance relation apart. Do you feel this inheritance is still necessary? If so, what does init(_owning:count:) mean for a TensorGroup, which is supposed to have a fixed number of elements?

eaplatanios · 2019-04-23T04:15:41Z

In this case TensorGroup has a default implementation:

init(_owning tensorHandles: UnsafePointer<CTensorHandle>?, count: Int) {
  precondition(count == _tensorHandleCount)
  self.init(_owning: tensorHandles)
}

rxwei · 2019-04-23T04:19:11Z

Yeah, makes sense. It would be very interesting to prototype this. I'm also curious to hear what @pschuh and @marcrasi think.

rxwei · 2019-04-23T04:23:35Z

Also, a random idea about generating boilerplates: Since each of these ops calls the same TFE functions, would it make sense to define a tfop Swift function that dispatches an op and some arguments for us? Yeah, we will need variadic generics, but we can overload tfop to a certain arity for now. This will reduce a ton of code duplication and make optimizations easier.

eaplatanios · 2019-04-23T04:26:08Z

Yes that would be great. I was hoping that we could make the current #tfop handle that and that's why I added the mode flag to support it. However, we would need to sort out the bug that causes the test failures when using #tfop and compiling with optimizations.

rxwei · 2019-04-23T04:41:05Z

#tfop is actually causing a lot of problems in our infrastructure because it's not maintained, and we need to deal with it often when we merge from master. Personally, I think it can be designed as a Builtin function, but that probably wouldn't work well with const attributes. @lattner what do you think?

eaplatanios · 2019-04-23T05:08:14Z

@rxwei I just pushed a change that implements this. The only changes required to compile this in stdlib are:

Adding init(_owning tensorHandles: UnsafePointer<CTensorHandle>?, count: Int) to TensorArrayProtocol.
Adding the following default implementation in a TensorGroup extension:

init(_owning tensorHandles: UnsafePointer<CTensorHandle>?, count: Int) {
  precondition(count == _tensorHandleCount)
  self.init(_owning: tensorHandles)
}

Changing the Array conformance to TensorArrayProtocol to this:

extension Array : TensorArrayProtocol where Element : TensorGroup {
  public func _unpackTensorHandles(into address: UnsafeMutablePointer<CTensorHandle>?) {
    var ptr = address
    for elem in self {
      elem._unpackTensorHandles(into: ptr)
      ptr = ptr!.advanced(by: Int(elem._tensorHandleCount))
    }
  }

  public var _tensorHandleCount: Int32 {
    var count: Int32 = 0
    for elem in self { count += elem._tensorHandleCount }
    return count
  }

  public init(_owning tensorHandles: UnsafePointer<CTensorHandle>?, count: Int) {
    let size = count / Element._tensorHandleCount
    self = Array((0..<size).map { Element(
      _owning: tensorHandles[$0 * Element._tensorHandleCount])
    })
  }
}

Regarding the Builtin function, I agree. It's actually nearly impossible for me to currently debug issues that pop up related to #tfop because it's super opaque with respect to what's happening underneath.

rxwei · 2019-04-23T05:45:51Z

Nice!

eaplatanios · 2019-04-23T06:28:15Z

~~There's still some issues to sort out with respect to data types of output lists but I'll look into that tomorrow.~~

…handling of input tensor lists for 'eager' mode.

eaplatanios · 2019-04-25T17:17:01Z

This PR should be ready for review along with swiftlang/swift#24229 . All tests pass locally as the changes are all backwards compatible.

dan-zheng · 2019-04-25T17:18:12Z

This PR should be ready for review along with apple/swift#24229 . All tests pass locally as the changes are all backwards compatible.

Could you please fix the merge conflict in RawOpsGenerated.swift?

eaplatanios · 2019-04-25T17:22:07Z

Could you please fix the merge conflict in RawOpsGenerated.swift?

Done! :)

eaplatanios · 2019-04-26T20:24:59Z

@rxwei This should be ready for review as it is backwards compatible and should not break anything in the existing codebase.

rxwei · 2019-04-27T00:14:31Z

Great. I'd get @pschuh's opinions on this first as I'm less familiar with binding generation.

pschuh

Just some minor comments.

EagerExecution.swift

generate_wrappers.py

pschuh · 2019-04-27T02:09:53Z

generate_wrappers.py

+      return self.op.inferred_counts[number_attr]
+    if number_attr:
+      return self.swift_name + 'Count'
+    if self.arg_def.type_list_attr:


Can't comment on the raw ops, but these appear to be codegenning as #tfops. This is breaking "saveV2" and "restoreV2" because they no longer return anything. I think the original logic was bad here. They should probably return [AnyTensor] or be otherwise blacklisted.

It depends on the mode you use. I currently set it to tfop-eager-fallback just for backwards compatibility, but the signature should be the same with eager mode. I don't understand though what is wrong with the following two ops. saveV2 does not return anything, as expected, and restoreV2 returns a value with type Dtypes that conforms to TensorGroup. This allows you to save and restore say a struct of tensors and avoids the loss of type information incurred by using [AnyTensor]. Maybe I am missing something though.

@inlinable @inline(__always) public static func saveV2<Dtypes: TensorArrayProtocol>( prefix: StringTensor, tensorNames: StringTensor, shapeAndSlices: StringTensor, tensors: Dtypes ) { return #tfop("SaveV2", prefix, tensorNames, shapeAndSlices, tensors, dtypes$dtype: tensors._typeList) } @inlinable @inline(__always) public static func restoreV2<Dtypes: TensorGroup>( prefix: StringTensor, tensorNames: StringTensor, shapeAndSlices: StringTensor ) -> Dtypes { let op = TFE_Op("RestoreV2") let _ = op.addInput(prefix) let _ = op.addInput(tensorNames) let _ = op.addInput(shapeAndSlices) op.setAttr("dtypes", Dtypes._typeList) return op.execute(Int(Dtypes._typeList.count)) }

My bad, it was just restoreV2 that I had a problem with. You're constraining _typeList to be a static value. This is not useful in the plan that I have. I'll fix it later I guess, but I would prefer disabling it.

Actually that was a debate I had. I ended up with somewhat of a middle ground where if the type whose _typeList property we want appears as an output arg only, we constrain it to be a TensorGroup and use a static property. Otherwise, we constrain it to be a TensorArrayProtocol and use an instance property. It’s just that if it’s an output arg in either case we’d need to unpack the tensor handles and that’s something that TensorGroup allows us to do. We could disable it for now but I’m curious what use case it doesn’t work for so we can try and think of a better way to do it.

I want to determine the type list at runtime. In this case, I will be serializing and deserializing a dynamic list of tensors.

That sounds like a special use case that's not easy to generalize over the raw ops generation. Given that the current generation script generates somewhat type-safe code, how about we add an untyped overload for restoreV2 which offers the functionality you need?

* Changed 'TensorArrayProtocol' such that it can be used to support output tensor arrays in raw ops. * Added a '_typeList' property to 'TensorArrayProtocol'. Friend PR: tensorflow/swift-bindings#26 .

eaplatanios · 2019-05-02T01:23:15Z

@rxwei @pschuh @saeta This is now using eager mode by default, as we discussed. I realized I was actually using _TFCEagerExecute so no changes required there.

EagerExecution.swift

EagerExecution.swift.gyb

eaplatanios · 2019-05-02T13:53:42Z

@pschuh I made the couple fixes you suggested. Currently trying to build and test swiftlang/swift#24425 with this version of the bindings.

EagerExecution.swift

eaplatanios · 2019-05-02T17:43:10Z

@pschuh I fixed the bug with the number attributes, but I still had to disable check(status) for addInputList because it was still failing for some reason, even though I set the number attribute before calling addInputList. I think this may be a bug in the C API and we can ignore it for now as we are inferring the number attribute on our own now anyway.

I can also confirm that all tests pass on my machine now.

eaplatanios · 2019-05-02T17:46:51Z

I also just removed support for the tfop mode to keep things cleaner.

pschuh and others added 6 commits March 13, 2019 20:29

Generate CTensorFlow api calls directly.

5f636ac

Regenerated bindings.

31065f8

Resolved merge conflict.

b865896

Re-wrote the code generation script.

22a7ca9

Minor update.

84c68a7

Switched to eager-mode generated ops.

4e1088b

rxwei requested review from pschuh, rxwei and bgogul April 23, 2019 03:35

Added support for output lists.

b77f443

eaplatanios added 3 commits April 23, 2019 01:17

Minor tweak.

53d97fd

Minor tweak.

3eb9edb

Bug fix.

66a3318

eaplatanios added 4 commits April 23, 2019 01:45

More bug fixes.

a982974

More bug fixes.

a1137b1

Minor tweaks.

52eadc0

Made a couple more bug fixes.

a38448d

Relaxed the 'TensorGroup' constraint for some types and improved the …

f00e890

…handling of input tensor lists for 'eager' mode.

eaplatanios mentioned this pull request Apr 25, 2019

[TF] TensorFlow/TensorFlowCore Refactoring swiftlang/swift#24261

Closed

Resolved merge conflict.

04ef324

pschuh reviewed Apr 27, 2019

View reviewed changes

Added a more prominent warning about the 'execute' methods.

a554023

Switched to 'eager' mode by default.

a328b7e

pschuh approved these changes May 2, 2019

View reviewed changes

pschuh reviewed May 2, 2019

View reviewed changes

EagerExecution.swift Outdated Show resolved Hide resolved

pschuh reviewed May 2, 2019

View reviewed changes

EagerExecution.swift.gyb Show resolved Hide resolved

eaplatanios added 2 commits May 2, 2019 09:51

Added calls to '_TFCOpSetDeviceFromScope'.

bdd3f67

Fixed a memory bug.

059c9c0

pschuh reviewed May 2, 2019

View reviewed changes

EagerExecution.swift Outdated Show resolved Hide resolved

eaplatanios added 2 commits May 2, 2019 11:36

Minor tweak.

326e336

Fixed a bug related to type list attributes and input tensor lists.

a3f2a9b

pschuh reviewed May 2, 2019

View reviewed changes

EagerExecution.swift Outdated Show resolved Hide resolved

eaplatanios added 2 commits May 2, 2019 13:38

Bug fix related to number attributes.

fdfec36

Disabled the status check for 'TFE_Op.addInputList'.

bb02d3e

Removed support for 'tfop' mode.

c965a9b

pschuh merged commit ee5ded2 into tensorflow:master May 2, 2019

eaplatanios mentioned this pull request May 2, 2019

Support input/output lists. #22

Closed

Rewrite of the code generation script. #26

Rewrite of the code generation script. #26

Uh oh!

Conversation

eaplatanios commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eaplatanios commented Apr 23, 2019

Uh oh!

rxwei commented Apr 23, 2019

Uh oh!

eaplatanios commented Apr 23, 2019

Uh oh!

eaplatanios commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rxwei commented Apr 23, 2019

Uh oh!

eaplatanios commented Apr 23, 2019

Uh oh!

rxwei commented Apr 23, 2019

Uh oh!

rxwei commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eaplatanios commented Apr 23, 2019

Uh oh!

rxwei commented Apr 23, 2019

Uh oh!

eaplatanios commented Apr 23, 2019

Uh oh!

rxwei commented Apr 23, 2019

Uh oh!

eaplatanios commented Apr 23, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eaplatanios commented Apr 25, 2019

Uh oh!

dan-zheng commented Apr 25, 2019

Uh oh!

eaplatanios commented Apr 25, 2019

Uh oh!

eaplatanios commented Apr 26, 2019

Uh oh!

rxwei commented Apr 27, 2019

Uh oh!

pschuh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pschuh Apr 27, 2019

Choose a reason for hiding this comment

Uh oh!

eaplatanios Apr 27, 2019

Choose a reason for hiding this comment

Uh oh!

pschuh Apr 28, 2019

Choose a reason for hiding this comment

Uh oh!

eaplatanios Apr 28, 2019

Choose a reason for hiding this comment

Uh oh!

pschuh Apr 29, 2019

Choose a reason for hiding this comment

Uh oh!

eaplatanios Apr 29, 2019

Choose a reason for hiding this comment

Uh oh!

eaplatanios commented May 2, 2019

Uh oh!

Uh oh!

Uh oh!

eaplatanios commented May 2, 2019

Uh oh!

Uh oh!

Uh oh!

eaplatanios commented May 2, 2019

Uh oh!

eaplatanios commented May 2, 2019

eaplatanios commented Apr 23, 2019 •

edited

Loading

eaplatanios commented Apr 23, 2019 •

edited

Loading

rxwei commented Apr 23, 2019 •

edited

Loading

eaplatanios commented Apr 23, 2019 •

edited

Loading