<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel>
	  <title>Ask MetaFilter questions tagged with datawarehouse</title>
      <link>http://ask.metafilter.com/tags/datawarehouse</link>
      <description>Questions tagged with 'datawarehouse' at Ask MetaFilter.</description>
	  <pubDate>Tue, 21 Apr 2009 14:14:16 -0800</pubDate> <lastBuildDate>Tue, 21 Apr 2009 14:14:16 -0800</lastBuildDate>

      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>	  
	<item>
	<title>Oracle Views: Are calculated columns always evaluated even when they are not considered in a query?</title>
	<link>http://ask.metafilter.com/120110/Oracle%2DViews%2DAre%2Dcalculated%2Dcolumns%2Dalways%2Devaluated%2Deven%2Dwhen%2Dthey%2Dare%2Dnot%2Dconsidered%2Din%2Da%2Dquery</link>	
	<description>Oracle Views: Are calculated columns always evaluated even when they are not considered in a query? I&apos;d like some advice regarding Oracle views. Are fields always evaluated regardless of whether they are required in the results set?&lt;br&gt;
&lt;br&gt;
Let&apos;s say I have tables PRODUCTS and ORDERS. And I create a view called PRODUCTS_VIEW which consists of something like:&lt;br&gt;
&lt;br&gt;
SELECT &lt;br&gt;
	PRODUCT.ID, &lt;br&gt;
	PRODUCT.NAME,&lt;br&gt;
	fcnGetMostRecentOrderDate(PRODUCT.ID) AS MostRecentOrderDate,&lt;br&gt;
	fcnGetTotalOrdersCount(PRODUCT.ID) AS TotalOrdersCount,&lt;br&gt;
	fcnGetMaxOrderVolumn(PRODUCT.ID) AS MaxOrderVolumn&lt;br&gt;
        and so on&lt;br&gt;
FROM&lt;br&gt;
     PRODUCTS,&lt;br&gt;
     ORDERS&lt;br&gt;
&lt;br&gt;
I.e. a flat view that encapsulates all possible attributes--I estimate around 20 static and 60 calculated fields.&lt;br&gt;
&lt;br&gt;
This is for reporting purposes, either queried directly or perhaps wrapped up in an API. Maintainability is important, if I need a new attribute then I can easily add a new field as necessary. Performance is certainly a factor but, since it&apos;s only for reporting purposes, there is a degree of flexibility.&lt;br&gt;
&lt;br&gt;
If I run a query that only considers 2 or 3 fields e.g.&lt;br&gt;
&lt;br&gt;
SELECT PRODUCT.NAME, MostRecentOrderDate&lt;br&gt;
 FROM PRODUCTS_VIEW &lt;br&gt;
WHERE... say....CLIENT = Heinz&lt;br&gt;
&lt;br&gt;
Would it still evaluate *all* the other calculated fields? (I think it does unfortunately: and I&apos;m concerned that performance would degrade as new fields are added.)&lt;br&gt;
&lt;br&gt;
If it *does* behave thus is there a way around it? I suppose a materialized view is an option. Or is this an inappropriate use? I am avoiding dynamic SQL since I would rather consolidate the logic into something that can be made (easily) accessible through ODBC, ADO, a bundle of Web Service methods etc. Note that a data warehouse approach (e.g. a daily dump of denormalised data) is not appropriate since data updates need to reflected immediately.&lt;br&gt;
&lt;br&gt;
Thanks.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2009:site.120110</guid>
	<pubDate>Tue, 21 Apr 2009 14:14:16 -0800</pubDate>
	<category>datawarehouse</category>
	<category>materializedview</category>
	<category>oracle</category>
	<category>resolved</category>
	<category>SQL</category>
	<category>view</category>
	<dc:creator>NailsTheCat</dc:creator>
	</item>
	<item>
	<title>help a layman understand multi-dimensional data modelling</title>
	<link>http://ask.metafilter.com/79482/help%2Da%2Dlayman%2Dunderstand%2Dmultidimensional%2Ddata%2Dmodelling</link>	
	<description>I&apos;d like to know how multi-dimensional data modelling works. My company has Cognos BI and I am working closely with the developers. I would like to understand how a star schema works, fact and dimension tables, etc. I&apos;d like to be able to more clearly envision whats going on with the data. How is it different from relational databases? Can you recommend a book or some online articles written for the layman? &lt;br&gt;
&lt;br&gt;
I don&apos;t necessarily need to understand how *to* model the data, just how to understand data retrieval *from* a model.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.79482</guid>
	<pubDate>Wed, 26 Dec 2007 12:02:49 -0800</pubDate>
	<category>cognos</category>
	<category>database</category>
	<category>datamart</category>
	<category>datamining</category>
	<category>datamodelling</category>
	<category>datawarehouse</category>
	<category>starschema</category>
	<dc:creator>goethean</dc:creator>
	</item>
	<item>
	<title>Datawarehouse 101</title>
	<link>http://ask.metafilter.com/70264/Datawarehouse%2D101</link>	
	<description>I am changing jobs in a couple of months, and need to know more about database design, and more specifically datawarehouse and datamart for reporting systems. I will be working on the ETL and reporting side, but would like to sound more knowledgeable about general database design techniques, such as normalisation, star schemas and so on, with particular emphasis on datawarehousing. The last time i looked at this Ralph Kimball was the datawarehouse reference point. Would it still be worth reading his Lifecycle Toolkit? Or is there somebody new out there that is a must-read? Online or hard-copy recommendations would be most welcome.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.70264</guid>
	<pubDate>Tue, 28 Aug 2007 02:22:50 -0800</pubDate>
	<category>database</category>
	<category>datamart</category>
	<category>datawarehouse</category>
	<category>design</category>
	<dc:creator>jontyjago</dc:creator>
	</item>
	<item>
	<title>Datamining the public web</title>
	<link>http://ask.metafilter.com/68120/Datamining%2Dthe%2Dpublic%2Dweb</link>	
	<description>How do i build a data warehouse that scrapes data from public websites for my own use? Tools? Tips? Hi. I would like to track apartments on a classifieds site and use the data for analyzing the inpact of diffrent things on price. What i need is a tool or scripting language that would make it easy for me to spider the website and put the data in a database. Preferable this would be an open source solution. &lt;br&gt;
&lt;br&gt;
I am also looking for good tools for extracting information out of longer pieces of text. For example on the site i want to mine users can put in comments on every object. I would like to be able to decide if a comment is positive, negative och neither. I have seen this be done on one online art site that i cant remember the name of right now. The artist used blog post and decided the mood of the writer by what words were used.</description>
	<guid isPermaLink="false">tag:ask.metafilter.com,2007:site.68120</guid>
	<pubDate>Mon, 30 Jul 2007 01:52:51 -0800</pubDate>
	<category>datamining</category>
	<category>datawarehouse</category>
	<category>scraping</category>
	<category>spidering</category>
	<dc:creator>ilike</dc:creator>
	</item>
	
	</channel>
</rss>

