Bug #2159: Metacat Performance: Divide xml_nodes based on the doctype - Metacat - Ecoinformatics Redmine

Actions

Copy link

Bug #2159

open

Metacat Performance: Divide xml_nodes based on the doctype

Added by Saurabh Garg over 19 years ago. Updated over 17 years ago.

Status:

New

Priority:

Normal

Assignee:

Jing Tao

Category:

metacat

Target version:

2.x.y

Start date:

07/13/2005

Due date:

% Done:

Estimated time:

Bugzilla-Id:

2159

Description

Matt suggested that xml_nodes table can probably be divided into multiple
tables based on the doctype. For example, xml_nodes can be divided into the
following:
1. xml_nodes_eml_2_0_0
2. xml_nodes_eml_2_0_1
3. xml_nodes_default

Hence any query for searching for a given text will be divided into 3 sub
queries, results of which can be unioned. While we are still going through the
same number of records (assuming the query is for all doctypes), this might
result in performance enhancement in a db like Oracle on a multi-proc machine.
From experience, Postgres will still run it on one proc... unless we run a
seperate query on each of the tables and union the results in the servlet

Though it is not clear how much performance would be achieved by this...

Actions

Copy link

Updated by Saurabh Garg about 18 years ago

This approach might be worth considering but I think it will only give short term benefits if at all any benefits. This is because most of the searched we have request both eml-2.0.0 and eml-2.0.1 as return doctypes. And these are the major doctypes that we hold right now. hence I dont think this approach will be helpful in any way.

Actions

Copy link

Updated by Redmine Admin over 11 years ago

Original Bugzilla ID was 2159

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Metacat

Custom queries

Bug #2159

Metacat Performance: Divide xml_nodes based on the doctype

Updated by Saurabh Garg about 18 years ago

Updated by Redmine Admin over 11 years ago