Questions tagged [vespa]

Questions related to the open big data serving engine Vespa.

Use Stackoverflow for questions related to the open big data serving engine Vespa.

Info: http://vespa.ai

Documentation: http://docs.vespa.ai/

Github: https://github.com/vespa-engine

230 questions
10
votes
1 answer

Is there an easy way to delete a complete Vespa document set?

Playing with Yahoo's vespa.ai, I'm now at a point where I have a search definition with which I am happy, but still, have a bunch of garbage test documents stored. Is there an easy way to delete/purge/drop all of them at once, ala SQL DROP TABLE or…
Shastick
  • 1,218
  • 1
  • 12
  • 29
5
votes
1 answer

Disable caching in vespa

I am doing load test on an application that uses Vespa as a database. I have some sample records for which I am doing the test. Now when I run the load test for the first time, Vespa caches the query result which affects our next test scenario…
Yash Kasat
  • 203
  • 1
  • 5
5
votes
2 answers

View all the documents loaded into vespa

Is there any way to fetch all the documents loaded into vespa? I tried querying with regular expressions, but it didn't work as expected. select * from entity where ID matches "[.]+"; ID is not an attribute, but I tried with an attribute field,…
Raghu Venmarathoor
  • 858
  • 11
  • 28
4
votes
1 answer

Searching non-primitive types within a struct

I need to search within a an array nested in another array. Let's say I have the following document schema foos { document foos { struct foo { field bars type array {} } field baz type string { …
Kyle Rowan
  • 87
  • 1
  • 4
4
votes
1 answer

How to get text matching percentage in vespa?

Is there any functionality where we can search a text and we get the matching percentage that how much the test has been matched or string distance between searched text and result text.
suyash308
  • 347
  • 1
  • 7
4
votes
2 answers

Does search latency increase with the document size?

Does the search latency increase when data keeps on growing in a document type? As we don't directly manage shard type configurations in Vespa, how does it manage it? Is creating multiple document types a good practice for handling scaling…
Harsh Choudhary
  • 475
  • 5
  • 12
4
votes
1 answer

Attribute Comparators in Vespa.ai

Does Vespa support comparators for string matching like Levenshtein, Jaro–Winkler, Soundex etc? Is there any way we can implement them as plugins as some are available in Elasticsearch? What are the approaches to do this type of searches?
Harsh Choudhary
  • 475
  • 5
  • 12
4
votes
2 answers

Query by partition in Vespa.ai

When a query is issued, Vespa runs the query on all the content nodes (in the distribution group) and returns the results. I have two keys that are always present in the search query. Can I partition the data from the values of those keys, so that…
Harsh Choudhary
  • 475
  • 5
  • 12
4
votes
1 answer

In-memory document storage in Vespa.ai

Is there any way to store document data in-memory rather than on disk? This way the performance can be increased because of reduced IO.
Harsh Choudhary
  • 475
  • 5
  • 12
4
votes
0 answers

Casting a JavaCC grammar definition into VSCode syntax highlight?

I'm using the Yahoo's Vespa engine that involves a specific grammar in its configurations files (called SearchDefinition). I have access to the grammar of that configuration language through a JavaCC .jj file (see my issue on Vespa's Github). My…
4
votes
2 answers

Scaling Vespa for 500 QPS for search

We have created a custom searcher. Our document size is around 400 000. Latency remains in less than 100ms but when we are doing load test, it does not give QPS of more than 80, and latency also increases up to 4-5 seconds. We are using 9 node…
Harsh Choudhary
  • 475
  • 5
  • 12
4
votes
1 answer

How to properly use fieldsets in vespa?

I'm seeing a warnings like these when running vespa-deploy prepare command The matching settings for the fields in fieldset 'default' are inconsistent (explicitly or because of field type). This may lead to recall and ranking issues. The…
Raghu Venmarathoor
  • 858
  • 11
  • 28
4
votes
2 answers

How should I customize my search result in vespa?

I have a search result like whcih is given below : { "root": { "id": "toplevel", "relevance": 1, "fields": { "totalCount": 20 }, "coverage": { "coverage": 100, "documents": 20, "full": true, …
Mohammad Sunny
  • 371
  • 1
  • 3
  • 15
4
votes
1 answer

How to configure shards in Vespa?

We want to setup cluster of 4 nodes to host data. And the cluster hosts one index only, so to have similar data type in all 4 node. Our goal is to have data sharded on the nodes. Let say two shards and two replica. (total 4 nodes to host these 4…
enator
  • 2,431
  • 2
  • 28
  • 46
4
votes
2 answers

Vespa: Can we aggregate on nested fields?

In search definition the fields inside struct can not have "attribute" indexing. http://docs.vespa.ai/documentation/reference/search-definitions-reference.html#field_types Also, struct and maps are not attribute by default. Resulting search…
enator
  • 2,431
  • 2
  • 28
  • 46
1
2 3
15 16