Artificial Intelligence Contracts: An Issue-Spotter – Part 2 of 3

This is the second of a three-post issue-spotter about contracts related to artificial intelligence, with a particular focus on gen-AI. Click here for the first of the series.

frustrated employee in front of computer
Prompt: Write software to automate the procedure above.
Output: I want an IP lawyer.

B. Ownership and Control of Outputs

  1. General Problems with “Ownership” of Outputs: AI systems produce data as outputs, and ownership of data is problematic, as bullet 1 explains (in the last post). The same goes for ownership of the more “creative” outputs from generative AI, but for different reasons. The law will not grant a copyright or patent for works created by an AI or any other non-human. The law does offer copyright and patent protection to humans who use AI to supplement their own creativity. But as of today, no one knows how much the AI can contribute. Ownership of outputs, then, could have some value for copyrights and patents, but it’s hard to say how much. Fortunately, AI can offer more certain contributions to its users’ trade secret and trademark rights, since they don’t need human creators. So contract terms about ownership might be worthwhile.
  2. Customer Ownership of Outputs: Usually, it’s the customer who wants ownership, particularly of gen-AI outputs. “Provider recognizes and agrees that Customer owns and will own Outputs, including without limitation any intellectual property rights therein, except as set forth ___.” (See bullet 9 below for the “except” language.) And the customer might want more: an assignment of any provider IP in outputs. But what’s to assign? The customer or its staff might have patent rights or copyrights in outputs, assuming an adequate human contribution, as discussed in bullet 6. The provider, on the other hand, may have nothing to contribute. The provider’s contribution, on the other hand, almost certainly won’t involve human work. So the provider could not have patent rights or copyrights related to outputs. And unless the gen-AI processes the provider’s confidential data – as discussed below in bullet 16 – the provider wouldn’t own trade secret rights. Nor would it own trademarks it hasn’t registered or used in commerce. Any assignment terms, then, would result from a serious abundance of caution on the customer’s part. Still, you never know – and it’s at least possible that outputs could include provider content from the training data. So: “Provider hereby assigns to Customer all its right, title, and interest in and to Outputs, including without limitation intellectual property rights, except as set forth ___.1For a discussion of assignment and for more detailed terms, see Chapter 1.F of The Tech Contracts Handbook. And see the book’s assignment terms reproduced here on TechContracts.com: 1.F.1. limited ownership transfer and 1.F.2. broad ownership transfer. (Again, see bullet 9 for the “except” language.)
  3. Provider Concerns – Refusing Output Ownership: Where possible, the AI provider should resist terms assinging outputs or even accepting customer ownership. In many systems (not just generative AI), outputs could include data or content owned or claimed by third parties or by the AI provider itself. The safest route for the provider, then, may be to say nothing about IP, leaving ownership to the underlying laws. Those laws might give the customer ownership, but not of anything that already has an owner. The provider should also consider disclaimers related to IP. “PROVIDER OFFERS NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, RELATED TO INTELLECTUAL PROPERTY OR OTHER RIGHTS IN OUTPUTS, AND CUSTOMER USES OUTPUTS AT ITS OWN RISK WITH REGARD TO ALL SUCH RIGHTS.” But see bullet 9 below.
  4. Provider Concerns – Limiting Output Ownership and Retaining Rights: In some deal-types, the customer can and does insist on owning outputs. But the provider should still try to limit ownership terms to an acknowledgement, like the first example above in bullet 7 – with no assignment. And with or without an assignment, the provider should add carve-outs. “The Customer ownership rights and the assignment in Section __ (Ownership and Assignment of Outputs) do not apply to Independent Assets.2If there’s no assignment, delete “and the assignment.” With some systems, the provider can offer a clear list of the data or content included in these “Independent Assets.” The provider knows the input data or training data the gen-AI processes, so it knows what could appear in outputs. But the provider can’t offer much certainty about Independent Assets if the AI draws on very broad or hard-to-define data, like the Internet-wide training data used by some gen-AI. “‘Independent Asset’ means any data or content created before or independent of the prompt that led the System to generate the Output in question, including without limitation any such data or content included in Outputs.” Unfortunately, the customer almost certainly won’t know what parts of outputs were created “before or independent of” its prompt. So the example above doesn’t tell it what components of the outputs it owns. The customer might have to accept that as the price of complex AI, though it should at least get a license, per bullet 10 below.
  5. License to Outputs: The customer probably doesn’t need a license to outputs, even if it doesn’t own them. The provider probably has no IP to license, as explained above in bullets 6 and 8. And if the provider did have IP, a license would probably be implied. But the abundance of caution that demands an assignment could demand a license too, particularly if the provider refuses ownership terms. “Provider hereby grants Customer a nonexclusive, perpetual, worldwide license to reproduce, distribute, modify, publicly perform, publicly display, and use Outputs, with the right to sublicense each and every such right. Provider grants the license in the preceding sentence under such copyrights as it may have, if any.” (Those terms could also apply to the Independent Assets discussed above in bullet 9.) Finally, the provider/licensor should consider disclaiming IP warranties related to outputs, using the language in bullet 8.
  6. Use of Outputs – Restrictions and Rights for Each Party: As we’ve seen, IP ownership might not give the customer much protection related to outputs, even if it works. So if the provider has access to outputs, the customer should consider restricting provider use of outputs. “Provider shall not: (a) publicize or distribute Outputs; or (b) access or use Outputs other than to maintain or improve the System or to support Customer’s use of the System. The preceding sentence does not restrict Provider’s rights to information from independent sources that also appears in Outputs.” Of course, if the provider uses outputs to train the system for other customers’ benefit (which isn’t likely), it probably can’t accept these or other use restrictions. Finally, the customer should consider protecting outputs through nondisclosure terms.

C. Trade Secrets and other Confidential Information Related to AI

  1. Don’t worry. You can trust me.

    Typical Prompts and Customer Training Data & Input Data – Nondisclosure Protecting Customer: If prompts or customer data could include sensitive information, the customer should consider nondisclosure provisions. The “Confidential Information” definition in standard NDA terms would include prompts, customer input data, and customer training/fine-tuning data. See Chapter 1.F of The Tech Contracts Handbook and the confidentiality terms on this site at II-H: Nondisclosure/Confidentiality. The provider can probably accept those terms if its gen-AI makes no future use of prompts or customer training data or if it uses them only to train the customer’s copy. But the provider needs the typical exceptions to nondisclosure, which protect the recipient’s rights to information received from a third party or generated independently. See Confidential Information Defined, on this site.

  2. Prompts and Customer Training Data Used to Train Provider Products – No Confidentiality, No Trade Secrets: If the system uses prompts or customer training data to train gen-AI or other machine learning for all customers’ use, the provider should not accept confidentiality terms protecting that information. In fact, it should consider disclaimers and warnings. “CUSTOMER RECOGNIZES AND AGREES THAT: (a) PROMPTS AND CUSTOMER TRAINING DATA ARE NOT CONFIDENTIAL AND MAY BE USED TO TRAIN THE SYSTEM FOR THE BENEFIT OF PROVIDER’S OTHER USERS AND CUSTOMERS; AND (b) INFORMATION GIVEN TO PROVIDER’S OTHER USERS AND CUSTOMERS MAY INCLUDE INFORMATION FROM CUSTOMER’S PROMPTS AND CUSTOMER TRAINING DATA UPLOADED PURSUANT TO THIS AGREEMENT.” In that case, the customer should keep sensitive information out of prompts and training data.
  3. Typical Outputs – Nondisclosure Protecting Customer + Trade Secret Precautions: If the provider can accept nondisclosure terms protecting prompts and customer training data, it can probably accept them for outputs. The usual caveats about information developed independently should protect the provider, as discussed in bullet 12. Among other things, those caveats should protect provider rights to its own information that winds up in outputs, like information from its training data. For the customer, nondisclosure terms offer the chance to establish trade secret rights in new information generated by the AI. And of course, those terms should protect existing customer secrets that find their way into outputs through prompts and training data. But the customer should take an additional precaution, outside the contract, if outputs could include sensitive information. The customer should treat outputs like trade secrets. It should make sure its staff keeps outputs confidential, at least until it can establish which outputs include potential trade secrets.
  4. Outputs from Systems that Reuse Prompts and Customer Training Data – Options for Provider: The fact that gen-AI reuses prompts or customer training data doesn’t necessarily rule out nondisclosure terms protecting outputs. The provider still might not use or even see outputs. So nondisclosure could work, again with the usual independent development caveats discussed in bullet 12. But some providers can’t promise nondisclosure. Their systems use outputs as training data. Or they run open-door AI systems (maybe B2C): software that shares incoming and outgoing information.
  5. Provider’s Training Data and Other Inputs – Nondisclosure Protecting Provider: Some software processes sensitive information supplied by its vendor/provider, and AI is no exception. Machine learning outputs could include sensitive information from provider-supplied training data. In that case, the provider needs nondisclosure promises from the customer. The usual caveats about independent development should protect the customer’s rights to preexisting and third party information, including information it includes in prompts. Again, see bullet 12 above. But nondisclosure still greatly limits the customer’s use of outputs. For some deals, the customer might need detailed terms on permitted use of confidential information. The following, for instance, could work for AI that helps set prices. “Customer may: (a) use Confidential Information in Outputs to set prices for its own purchase or sale of Priced Assets (“Internal Use Transactions”) but not for any transaction that does not directly involve Customer as a buyer or seller; and (b) disclose Confidential Information in Outputs to its contractor assisting Customer with Internal use Transactions, subject to Section __ (Contractor NDAs).” Of course, customer nondisclosure obligations would complicate customer ownership or exercise of IP rights in outputs (the subject of bullets 6, 9, and 10).
  6. Provider’s Sensitive Information in Models and Other Tech – Nondisclosure Protecting Provider: Use of software and computer systems can give the customer access to the provider’s sensitive information. Again, AI is no exception. If so, the provider needs nondisclosure terms protecting its technology. See Chapter 1.F of The Tech Contracts Handbook and the confidentiality terms on this site at II-H: Nondisclosure/Confidentiality. The provider’s “Confidential Information” might then include AI software, models, documentation, user interfaces, and just about any information available when the customer logs in.

The third and final post in this series addresses liability related to outputs and training data, including liability for errors in outputs and for IP infringement, misuse of private information, and defamation. It also addresses special issues, like data security and responsible use of artificial intelligence. Click here.


© 2023, 2024 by Tech Contracts Academy, LLC. All rights reserved.

Related Posts