Remove null values from an array in a UDF

Darryl_Naidu · July 12, 2024, 2:43pm

I have a function that returns an array of distinct string values though want to figure out how to drop a null value which is inherently included within the returned array. Can anyone help rewrite the UDF below to assist?

The v4 function is below:

Query(
  Lambda(
    [],
    Union(
      Select(
        ["data"],
        Paginate(Distinct(Match(Index("allAllergens"))), { size: 100000 })
      )
    )
  )
)

ptpaterson · July 15, 2024, 3:20pm

Hi @Darryl_Naidu!

There is a few things going on here, so it will be hard to know how to answer. Can you share the index definition, so we can see the shape of results? Please also provide an example document that would match the index.

Some initial considerations:

You’ll want to also port the index over to v10. Seeing the v4 definition we can see how to do that.
Union here looks like you just need that to flatten results. You can use flatmap or flatten if necessary in v10 for that.
you can drop a value from an array by using .where(x => x != null). But it’s not clear at all what the data looks like to start. Maybe there is something we can do with the index, or leverage a computed field, to simplify things.

Darryl_Naidu · July 18, 2024, 3:49pm

Thanks @ptpaterson,

I have actually solved this with a Filter expression which I hope is reasonably efficient. The revision is below:

{
  name: "FindAllAllergens",
  role: null,
  body: Query(
    Lambda(
      [],
      Filter(
        Union(
          Select(
            ["data"],
            Paginate(Distinct(Match(Index("allAllergens"))), { size: 100000 })
          )
        ),
        Lambda("i", IsString(Var("i")))
      )
    )
  )
}

If anyone could help with a v10 conversion for this expression that would be a great help. The v10 equivalent index used is below:

index allAllergens {
    values [.allergens, .potential_allergens]
  }

ptpaterson · July 23, 2024, 3:38pm

What is the allAllergens v4 index definition? It looks like you are only checking if the first value defined in the v4 index is a string. Are you not concerned about the second value? And is it the same order as the values defined in the v10 definition? Just making sure

You might try something like this as a v10 equivalent:

Allergens.allAllergens()
  .fold([], (acc, val) => acc.concat(val.allergens).concat(val.potential_allergens))
  .where(a => a isa String)
  .distinct()

See docs for the isa operator. Operators - Fauna Docs

And this related topic regarding that fold operation:

Darryl_Naidu · August 9, 2024, 4:20pm

Thanks @ptpaterson! This works exactly as intended. Compute ops significantly reduced by re-writing the UDF as:
Allergens.allAllergens() .fold([], (acc, val) => acc.concat(val.allergens).concat(val.potential_allergens)) .distinct() .where(a => a isa String)

Now to find a less hungry alternative for fold

ptpaterson · November 7, 2024, 4:20pm

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using the Object.values() and flatMap() functions in v10 UDFs Help	6	74	November 7, 2024
How to generate empty Sets? Help fql	5	425	February 12, 2021
First value from index Help fql , udfs , best-practices	3	335	April 15, 2022
UDFs in index bindings / array generation for search over index Help fql , udfs , functions	2	849	May 4, 2021
Help updating an array element Help fql , udfs	4	1041	August 1, 2021

Remove null values from an array in a UDF

Related topics