[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

jperez999 · 2022-06-27T22:37:27Z

Problem:

As a user, I would like to run my merlin systems inference pipeline in an offline setting. This will allow me to produce a set of recommendations for all users to be served from a data store, email campaign, etc. I will also be able to conduct rigorous testing and better compare behaviors against other systems, at both operator and system level.

Goal:

To do this I need to be able to run my merlin systems inference graph without using triton or the configs generated for it. It will require a new operator executor class that runs the ops in python instead of tritonserver. The execution should behave exactly as it does in the tritonserver setting, meaning each operator should be provided same inputs, and return same outputs.

Run an Inference operator graph without tritonserver.
Does not require any new user-facing API changes.
Execute the same graph, that would be deployed to tritonserver.
Execute in Python process

Constraints:

Use the same merlin systems graph/ops that were created for inference pipeline, that would run on tritonserver
Swap out the operator executor to python version (non-triton).
Allow for all types of graphs, supporting multiple chains and parallel running of ALL available operators.

TODO:

Core

Systems

Rework Systems operators to use DictArray and LocalExecutor from Merlin Core systems#204
Validate that we can run a systems ensemble on Dask

Issues

Example

Example for offline batch processing #798

Tasks

Give feedback

Create Offline runtime, that will swap operators according to usage i.e. (swap feast operator for dataset merge operator.
Ensure every operator returns batch based results. I.e. faiss should return batch representation of inputs. I.e. 2 users in should produce (2, 100) not (200,) shape.
Create an offline example from the current multistage example in merlin
Ensure ensemble export does not prevent using Non-triton runtimes later.
Options

The text was updated successfully, but these errors were encountered:

viswa-nvidia · 2022-07-13T17:01:14Z

Assignees will be Karl / Adam.

sohn21c · 2022-07-13T21:32:03Z

This is a prerequisite for cross-FW evaluation

nv-alaiacano · 2022-07-18T18:35:05Z

My impression is that batch inference for models is required for cross-FW evaluation, not the full batch inference for a system. The additional steps in the Systems' computation graph (QueryFeast, QueryFaiss, Softmax, filtering, etc) would likely not be required for batch inference on a single Model. Batch inference for the model would have a simpler "training data in -> predictions out" process, which would likely be a step in the Systems graph.

Perhaps we should first build the batch inference functionality (apply nvt transform + use model to predict) including the output format schema, and then that functionality could be shared in cross-FW evaluation and systems-wide batch prediction.

karlhigley · 2022-07-19T16:00:38Z

We do have some batch prediction functionality for models already, but it's not quite structured in a way that would make it a reasonable foundation for batch processing of graphs. I think we could massage it in that direction though and try to standardize how batch graph processing works in Merlin Core by taking what exists and refactoring it in the right direction.

bschifferer · 2022-10-17T21:02:45Z

@karlhigley do you think we should add an example for it?

karlhigley · 2022-10-20T14:28:56Z

I think we should add an example for every new piece of significant functionality (i.e. almost all roadmap issues.)

jperez999 · 2023-06-27T16:40:41Z

NVIDIA-Merlin/core#352
NVIDIA-Merlin/systems#376

jperez999 · 2023-07-05T16:08:15Z

This is not considered done until we can run all systems operators with a dask executor to create recommendation. Currently some systems operators work with batches of input data as shown in 1022. We need to make all operators work with batches of incoming data.

karlhigley · 2023-07-05T16:10:56Z

@jperez999 Could you add appropriate tasks to the list in the description?

karlhigley · 2023-07-05T16:11:30Z

(People don't generally scroll down to see the latest comments when we look at WIP issues to track their progress, so a comment helps but a description update is better.)

jperez999 · 2023-07-05T16:17:24Z

Need to be able to swap out certain operators, based on runtime. I.e. when running daskexecutor for offline batch it is not necessary to run the feature store operator unless we are testing against it. You could run a dataset merge operator instead using offline features stored in a parquet file. Please refer to task list created for further tracking

jperez999 added the roadmap label Jun 27, 2022

jperez999 assigned EvenOldridge Jun 27, 2022

jperez999 mentioned this issue Jun 28, 2022

Merlin Data Conversion Support #420

Open

6 tasks

karlhigley added this to the Merlin 22.08 milestone Jul 13, 2022

karlhigley assigned karlhigley and nv-alaiacano and unassigned EvenOldridge Jul 13, 2022

viswa-nvidia modified the milestones: Merlin 22.08, Merlin 22.09 Jul 15, 2022

EvenOldridge modified the milestones: Merlin 22.09, Merlin 22.08 Jul 15, 2022

viswa-nvidia removed this from the Merlin 22.08 milestone Jul 21, 2022

EvenOldridge added this to the Merlin 22.10 milestone Jul 28, 2022

karlhigley modified the milestones: Merlin 22.10, Merlin 22.09 Jul 29, 2022

EvenOldridge assigned bschifferer and rnyak Aug 3, 2022

viswa-nvidia mentioned this issue Aug 5, 2022

[RMP] Cross-framework model evaluation metrics #407

Open

7 tasks

karlhigley modified the milestones: Merlin 22.09, Merlin 22.10 Aug 22, 2022

viswa-nvidia modified the milestones: Merlin 22.10, Merlin 22.11 Aug 22, 2022

karlhigley assigned jperez999 Sep 6, 2022

viswa-nvidia assigned oliverholworthy Oct 4, 2022

karlhigley changed the title ~~[RMP] Support Offline Batch processing of Inference Pipelines~~ [RMP] Support Offline Batch processing of Recs Generation Pipelines Oct 19, 2022

viswa-nvidia modified the milestones: Merlin 22.11, Merlin 22.12 Oct 25, 2022

EvenOldridge modified the milestones: Merlin 22.12, Merlin 23.02 Nov 3, 2022

karlhigley mentioned this issue Dec 21, 2022

[INF] Merlin Commons #776

Open

11 tasks

viswa-nvidia modified the milestones: Merlin 23.02, Merlin 23.03 Jan 10, 2023

EvenOldridge modified the milestones: Merlin 23.03, Merlin 23.04 Jan 24, 2023

viswa-nvidia modified the milestones: Merlin 23.04, Merlin 22.06, Merlin 23.05 Mar 21, 2023

viswa-nvidia modified the milestones: Merlin 23.05, Merlin 23.07 Apr 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

jperez999 commented Jun 27, 2022 •

edited

Loading

Tasks

viswa-nvidia commented Jul 13, 2022

sohn21c commented Jul 13, 2022

nv-alaiacano commented Jul 18, 2022

karlhigley commented Jul 19, 2022

bschifferer commented Oct 17, 2022

karlhigley commented Oct 20, 2022

jperez999 commented Jun 27, 2023

jperez999 commented Jul 5, 2023

karlhigley commented Jul 5, 2023

karlhigley commented Jul 5, 2023

jperez999 commented Jul 5, 2023

[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

[RMP] Support Offline Batch processing of Recs Generation Pipelines #419

Comments

jperez999 commented Jun 27, 2022 • edited Loading

Problem:

Goal:

Constraints:

TODO:

Core

Systems

Issues

Example

Tasks

viswa-nvidia commented Jul 13, 2022

sohn21c commented Jul 13, 2022

nv-alaiacano commented Jul 18, 2022

karlhigley commented Jul 19, 2022

bschifferer commented Oct 17, 2022

karlhigley commented Oct 20, 2022

jperez999 commented Jun 27, 2023

jperez999 commented Jul 5, 2023

karlhigley commented Jul 5, 2023

karlhigley commented Jul 5, 2023

jperez999 commented Jul 5, 2023

jperez999 commented Jun 27, 2022 •

edited

Loading