Project

General

Profile

Actions

Bug #2154

closed

Metacat Performace: Configurable path condition indices

Added by Saurabh Garg almost 19 years ago. Updated over 18 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
metacat
Target version:
Start date:
07/13/2005
Due date:
% Done:

0%

Estimated time:
Bugzilla-Id:
2154

Description

From Matt's email...

Configurable path condition indices -- this would allow admins to
configure specific XML paths in metacat to be replicated to their own
table in the DB (which would be dynamically created) to make searching
fast for those paths. This is effective because clients (e.g., Morpho,
web) tend to use only a limited set of where clause restrictions (e.g.,
on title, surname, keyword, ...). In a typical EML document with 5000
nodes, our current search has to hit many records when searching for a
path like "dataset/title". Given that each document has only one or a
few titles, a dedicated table for title that is indexed might only have
3500 records in the "dataset/title" table (compared to ~106 in
xml_nodes), which would be a significantly faster query. Also, by
creating temporary tables for all commonly searched fields, we would
possibly avoid searchcing the xml_nodes table altogether, and therefore
avoid the whole recursive query issue. See Sid's point #2 below for
more details. This has a lot of potential for speeding up structured
queries (e.g., spatial), but xml_nodes would still be used for
unstructured (anyfield) queries.

Actions

Also available in: Atom PDF