<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>MARC Content Designation Utilization</title>
	<atom:link href="http://www.mcdu.unt.edu/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://www.mcdu.unt.edu</link>
	<description>Inquiry and Analysis</description>
	<pubDate>Thu, 18 Dec 2008 16:29:23 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Identifying and Understanding Factors Affecting Catalogersâ€™ Utilization of MARC</title>
		<link>http://www.mcdu.unt.edu/?p=48</link>
		<comments>http://www.mcdu.unt.edu/?p=48#comments</comments>
		<pubDate>Tue, 01 Jan 2008 21:19:16 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=48</guid>
		<description><![CDATA[The MCDU Project investigated catalogersâ€™ use of MARC fields, subfields, and other content designation structures (CDS) in over 56 million MARC bibliographic records from OCLCâ€™s WorldCat database. As we confirmed through our analysis that a relatively small percentage of available CDS are typically used, the recurring question was: Why do catalogers do what they do? [...]]]></description>
			<content:encoded><![CDATA[<p>The MCDU Project investigated catalogersâ€™ use of MARC fields, subfields, and other content designation structures (CDS) in over 56 million MARC bibliographic records from OCLCâ€™s WorldCat database. As we confirmed through our analysis that a relatively small percentage of available CDS are typically used, the recurring question was: Why do catalogers do what they do? The MCDU Project did not study this question but the Project provide a wealth of empirical data about the records created in the cataloging enterprise. The results and findings about catalogersâ€™ use of MARC CDS can be further informed by studying the factors that affect catalogersâ€™ utilization of MARC.</p>
<p>In our original proposal to the Institute of Museum and Library Services for the MCDU Project, we included as one of the project goals: <i>Investigate a methodological approach to understand the factors contributing to current levels of MARC content designation use and relationships with the cataloging enterprise</i>. A specific objective of the MCDU Project was: <i>Develop a methodological approach to identify and understand factors contributing to catalogersâ€™ use of MARC content designation</i>.  We have addressed this objective by producing the following document:</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=48</wfw:commentRss>
		</item>
		<item>
		<title>MARC Content Designation Use over Time</title>
		<link>http://www.mcdu.unt.edu/?p=47</link>
		<comments>http://www.mcdu.unt.edu/?p=47#comments</comments>
		<pubDate>Tue, 01 Jan 2008 21:13:03 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=47</guid>
		<description><![CDATA[The MCDU Project focused on analyzing catalogerâ€™s use of MARC fields, subfields, and other content designation structures (CDS). The first phase of that analysis examined the use of MARC CDS based on format of material and source of cataloging. We know, however, that over time the CDS available to catalogers changed as MARC evolved. New [...]]]></description>
			<content:encoded><![CDATA[<p>The MCDU Project focused on analyzing catalogerâ€™s use of MARC fields, subfields, and other content designation structures (CDS). The first phase of that analysis examined the use of MARC CDS based on format of material and source of cataloging. We know, however, that over time the CDS available to catalogers changed as MARC evolved. New CDS were added, some were deleted, and format integration brought together similar CDS from various formats. </p>
<p>The MCDU Project Team carried out a second phase of analysis that attempted to take into account the dynamic and evolving nature of MARC and the CDS available to catalogers over time. A document listed below describes the methodology for this analysis. In addition, the frequency counts and tabulation upon which this summary is based are available in a set of data reports listed below.</p>
<p>Before this analysis could be carried out, it was first necessary to document when new fields, subfields, and other CDS were added, deleted, and/or changed. No existing database or resource existed that documented the evolution of MARC CDS. The MCDU Project created <b>HistoriMARC</b>, a database that stores in structured form all available information about MARC bibliographic CDS from 1972 through 2004. HistoriMARC was the enabling tool for carrying out the analysis of MARC use over time since it allowed us to manipulate and reuse data about CDS additions, deletions, and other changes in our analysis. A summary of the results and findings is provided in the document, <b><i>Catalogers&#8217; Use of MARC Content Designation over Time: An Analysis of MARC Records from 1972 to 2004</b></i>.</p>
<p><a href="http://www.mcdu.unt.edu/wp-content/CatalogersUseOverTime_Final_30Dec2007.pdf">Catalogers&#8217; Use of MARC Content Designation over Time: An Analysis of MARC Records from 1972 to 2004</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=47</wfw:commentRss>
		</item>
		<item>
		<title>Extent of Support for FRBR User Tasks</title>
		<link>http://www.mcdu.unt.edu/?p=46</link>
		<comments>http://www.mcdu.unt.edu/?p=46#comments</comments>
		<pubDate>Tue, 19 Dec 2006 21:28:57 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=46</guid>
		<description><![CDATA[The frequency count analysis and the identification of commonly used elements provide the basis for addressing another project objective, namely, examining the extent to which the four user tasks identified in the Functional Requirements for Bibliographic Records (FRBR) are supported by catalogers&#8217; utilization of MARC elements in the MCDU dataset.  
We first developed a [...]]]></description>
			<content:encoded><![CDATA[<p>The frequency count analysis and the identification of commonly used elements provide the basis for addressing another project objective, namely, examining the extent to which the four user tasks identified in the Functional Requirements for Bibliographic Records (FRBR) are supported by catalogers&#8217; utilization of MARC elements in the MCDU dataset.  </p>
<p>We first developed a methodology for comparing the MCDU results with the data elements of the MARC record shown by Delsey’s research to support the four resource discovery tasks: find, identify, select, and obtain. We then carried out the analysis and developed a report of our findings. These documents, along with a spreadsheet used for analyzing the frequency data are listed below. The documents are still in draft form, as we continue working on this project activity. </p>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FRBRMappingMethodologyglsd130Jan2006.pdf">A Methodology for Mapping Frequently Occurring MARC Content Designation to FRBR Resource Discovery Tasks</a>[draft document]</li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FRBRCDSallSetswem30Nov2006.xls">Spreadsheet for FRBR Analysis (combining frequency count data and Delsey&#8217;s mapping of MARC Elements to FRBR user tasks</a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FRBRAnalysisglsd105May2006.pdf">FRBR Analysis</a>[draft document]</li>
</ul>
<p>Back to <a href="http://www.mcdu.unt.edu/?p=41">Analysis Reports and Results</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=46</wfw:commentRss>
		</item>
		<item>
		<title>Commonly Used MARC Elements</title>
		<link>http://www.mcdu.unt.edu/?p=45</link>
		<comments>http://www.mcdu.unt.edu/?p=45#comments</comments>
		<pubDate>Tue, 19 Dec 2006 21:28:32 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=45</guid>
		<description><![CDATA[One of the research goals of the project, to provide empirical evidence to document MARC21 content designation use by catalogers, has been partially met by the objective of conducting frequency counts of all fields and subfields used in the OCLC WorldCat database. Another research objective is to identify commonly used elements in bibliographic records based [...]]]></description>
			<content:encoded><![CDATA[<p>One of the research goals of the project, to provide empirical evidence to document MARC21 content designation use by catalogers, has been partially met by the objective of conducting frequency counts of all fields and subfields used in the OCLC WorldCat database. Another research objective is to identify commonly used elements in bibliographic records based on the analysis of format-specific record sets and comparing these elements with existing recommendations by Library of Congress agencies for national level, core level, and minimal level records. </p>
<p>In support of the research goals, we carried out a set of analyses to address the following research questions: What are the sets of commonly used elements per format, and how do these compare with the elements prescribed in current national, core, and minimal level recommendations or guidelines for cataloging? Conversely, are there elements which are frequently used by catalogers but are not prescribed in current national, core, and minimal level recommendations or guidelines for cataloging? </p>
<p>We first analyzed and compared the utilization of MARC content designation in the MCDU dataset with two bibliographic record standards, <a href="http://www.loc.gov/catdir/pcc/bibco/coreintro.html">Program for Cooperative Cataloging (PCC) BIBCO Core Record Standards</a>, and <a href="http://www.loc.gov/acq/conser/recordreq.html">CONSER Record Standards</a>. We then analyzed and compared the utilization with <a href ="http://www.loc.gov/marc/bibliographic/nlr/">National and Minimal Level Requirements for Bibliographic Records</a>.  The methodology and reports of the analysis are linked below. In addition, we provide the spreadsheet file used in the comparison of MCDU utilization with the National and Minimal Level Requirements for Bibliographic Records. The first worksheet in the file gives instructions on use of the spreadsheet and its embedded filters.</p>
<p>Finally, we identified the most commonly occurring elements, or those that occur within a defined threshold, in the MCDU dataset. These commonly occurring elements are given context by comparing them with the National and Minimal Level Bibliographic Record Requirements, and the Program for Cooperative Cataloging’s (PCC) BIBCO Core Record Standards and CONSER Record Requirements for Full, Minimal and Core Level Records for Serials. </p>
<p>Through this comparison of recommendations of elements with actual usage of elements by catalogers, we hope the results can illuminate the intersection of standards and practice in cataloging, and to inform future development of standards.</p>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/PCCMappingMethodologyae4April2006.pdf" title="Methodology for Mapping Frequently Occurring MARC Content Designation to PCC and CONSER Core Record Standards ">Methodology for Mapping Frequently Occurring MARC Content Designation to PCC and CONSER Core Record Standards</a> [draft document]</li>
<li><a href="http://www.mcdu.unt.edu/wp-content/NatMinCDSOccurThrallSetsae10Apr2006.xls" title="Spreadsheet for Analysis of National and Minimal Level Requirements">Spreadsheet for Analysis of National and Minimal Level Requirements</a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/PCCompared1ae3Apr2006.pdf" title="Comparison of MARC Content Designation Utilization with PCC BIBCO and CONSER Standards">Comparison of MARC Content Designation Utilization with PCC BIBCO and CONSER Standards</a> [draft document]</li>
<li><a href="http://www.mcdu.unt.edu/wp-content/SummaryResultsNatMinLevelae26April2006.pdf" title="National and Minimal Level Requirements: Summary of Analysis Results">National and Minimal Level Requirements: Summary of Analysis Results</a> [draft document]</li>
<li><a href="http://www.mcdu.unt.edu/wp-content/CoreElementsAnalysisae3May2006.pdf" title="Preliminary Analysis of Commonly Occurring Elements in MARC21 Records from OCLC WorldCat">Preliminary Analysis of Commonly Occurring Elements in MARC21 Records from OCLC WorldCat</a> [draft document]</li>
</ul>
<p>Back to <a href="http://www.mcdu.unt.edu/?p=41">Analysis Reports and Results</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=45</wfw:commentRss>
		</item>
		<item>
		<title>Results of Frequency Counts Analysis</title>
		<link>http://www.mcdu.unt.edu/?p=43</link>
		<comments>http://www.mcdu.unt.edu/?p=43#comments</comments>
		<pubDate>Mon, 18 Dec 2006 16:19:35 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=43</guid>
		<description><![CDATA[The first phase of our analyses focused on determining the frequency of occurrences of all the MARC content designation structures. We profiled the entire dataset, and the results are listed in the first document below. 
The entire dataset was separated into 20 databases based on format of material described (e.g., electronic resources,  sound recordings, [...]]]></description>
			<content:encoded><![CDATA[<p>The first phase of our analyses focused on determining the frequency of occurrences of all the MARC content designation structures. We profiled the entire dataset, and the results are listed in the first document below. </p>
<p>The entire dataset was separated into 20 databases based on format of material described (e.g., electronic resources,  sound recordings, and graphic materials)  and the original cataloging source (either Library of Congress or OCLC member libraries). We then carried out frequency count analysis on each of the twenty subsets of the dataset. The links below connect to the resulting 20 data reports.  The first link is to a spreadsheet that contains much of the data used in the analysis. The spreadsheet has embedded filters that allow various views of the data, including the content designation in the various subsets of records that are within the calculated threshold we use to determine commonly occurring elements.  </p>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FreqCountsCDSOccurThrallSetswem20Dec2006.xls" title="Spreadsheet with Frequency Count (for all MARC elements across the entire dataset and for subsets)">Spreadsheet with Frequency Count (for all MARC elements across the entire dataset and for subsets)</a></li>
</ul>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRGenProfileswemFinal20Dec2005.pdf" title="Format Content Designation Analysis: Data Report—General Profiles">Format Content Designation Analysis: Data Report—General Profiles  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/> </a>
<p>This document provides an overall picture of the MCDU dataset of 56,177,383 MARC 21 records. The profiles of the entire set of records as well as some of the subsets of records (based on either source of cataloging or format of material described) offer a view of some key characteristics such as length of records, date of records, type of records, etc.</p>
</ul>
<h4><u>Frequency Count Results for Records Created by the Library of Congress</u></h4>
<p>The dataset contains 8,713,665	records (approximately 15.5% of the total MCDU dataset) that can be attributed to catalogers at the Library of Congress or cooperative cataloging projects supervised by the Library of Congress. The following reports present the results of frequency count analyses on these sets of MARC records distinguished by the type of material being described. These are final versions of these data reports (March 2006).</p>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount01BLCgls20Mar2006.pdf" title="Frequency Counts for Books, Pamphlets, and Printed Sheets (Set 01_B_LC)"> Frequency Counts for Books, Pamphlets, and Printed Sheets (Set 01_B_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/> </a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount02CMLCgls20Mar2006.pdf" title="Frequency Counts for Cartographic Materials Records (Set 02_CM_LC)"> Frequency Counts for Cartographic Materials Records (Set 02_CM_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/> </a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount03ERLCgls20Mar2006.pdf" title="Frequency Counts for Electronic Resources Records (Set 03_ER_LC)"> Frequency Counts for Electronic Resources Records (Set 03_ER_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/> </a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount04CRLCgls20Mar2006.pdf" title="Frequency Counts for Continuing Resources Records (Set 04_CR_LC)" />Frequency Counts for Continuing Resources Records (Set 04_CR_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount05MSLCgls20Mar2006.pdf" title="Frequency Counts for Manuscripts (including Manuscript Collections) (Set 05_MS_LC)" />Frequency Counts for Manuscripts (including Collections) (Set 05_MS_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount06MULCgls20Mar2006.pdf" title=" Frequency Counts for Music Records (Set 06_MU_LC)" /> Frequency Counts for Music Records (Set 06_MU_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount07SRLCgls20Mar2006.pdf" title="Frequency Counts for Sound Recording Records (Set 07_SR_LC)" />Frequency Counts for Sound Recording Records (Set 07_SR_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount08PMLCgls20Mar2006.pdf" title="Frequency Counts for Projected Media Records (Set 08_PM_LC)" />Frequency Counts for Projected Media Records (Set 08_PM_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount09GMLCgls20Mar2006.pdf" title="Frequency Counts for Graphic Materials Records (Set 09_GM_LC)" />Frequency Counts for Graphic Materials Records (Set 09_GM_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount103DLCgls20Mar2006.pdf" title="Frequency Counts for Three Dimensional Objects and Realia (Set 10_3D_LC)" />Frequency Counts for Three Dimensional Objects and Realia (Set 10_3D_LC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
</ul>
<h4><u>Frequency Count Results for Records Created by the OCLC Member Libraries </u></h4>
<p>The dataset contains 47,463,718 records (approximately  84.5% of the total MCDU dataset) that can be attributed to catalogers at OCLC member libraries. The following reports present the results of frequency count analyses on these sets of MARC records distinguished by the type of material being described. These are final versions of these data reports (March 2006).</p>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount01BnonLCae20Mar2006.pdf" title=" Frequency Counts for Books, Pamphlets, and Printed Sheets (Set 01_B_nonLC)" /> Frequency Counts for Books, Pamphlets, and Printed Sheets (Set 01_B_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount02CMnonLCae20Mar2006.pdf" title="Frequency Counts for Cartographic Materials Records (Set 02_CM_nonLC)" />Frequency Counts for Cartographic Materials Records (Set 02_CM_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount03ERnonLCae20Mar2006.pdf" title="Frequency Counts for Electronic Resources Records (Set 03_ER_nonLC)" />Frequency Counts for Electronic Resources Records (Set 03_ER_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount04CRnonLCae20Mar2006.pdf" title="Frequency Counts for Continuing Resources Records (Set 04_CR_nonLC)" />Frequency Counts for Continuing Resources Records (Set 04_CR_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount05MSnonLCae20Mar2006.pdf" title="Frequency Counts for Manuscripts (including Collections) (Set 05_MS_nonLC)" />Frequency Counts for Manuscripts (including Collections) (Set 05_MS_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount06MUnonLCae20Mar2006.pdf" title="Frequency Counts for Music Records (Set 06_MU_nonLC)" />Frequency Counts for Music Records (Set 06_MU_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount07SRnonLCae20Mar2006.pdf" title=" Frequency Counts for Sound Recordings (Set 07_SR_nonLC)" /> Frequency Counts for Sound Recordings (Set 07_SR_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount08PMnonLCae20Mar2006.pdf" title="Frequency Counts for Projected Media (Set 08_PM_nonLC)" />Frequency Counts for Projected Media (Set 08_PM_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount09GMnonLCae20Mar2006.pdf" title="Frequency Counts for Graphic Materials (Set 09_GM_nonLC)" />Frequency Counts for Graphic Materials (Set 09_GM_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANDRFreqCount103DnonLCae20Mar2006.pdf" title="Frequency Counts for Three Dimensional Objects and Realia (Set 10_3D_nonLC)" />Frequency Counts for Three Dimensional Objects and Realia (Set 10_3D_nonLC)  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/></a></li>
</ul>
<p>Back to <a href="http://www.mcdu.unt.edu/?p=41">Analysis Reports and Results</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=43</wfw:commentRss>
		</item>
		<item>
		<title>Preparation of the Data for Analyses</title>
		<link>http://www.mcdu.unt.edu/?p=42</link>
		<comments>http://www.mcdu.unt.edu/?p=42#comments</comments>
		<pubDate>Mon, 18 Dec 2006 12:35:43 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=42</guid>
		<description><![CDATA[The MCDU Project Team&#8217;s first challenge was to prepare the 56 million MARC records for analysis.  This page has links to documents that provide information about the decomposition of the MARC records, the design of a MySQL database to hold the decomposed records, the database loading, and the validation of the parsing software that [...]]]></description>
			<content:encoded><![CDATA[<p>The MCDU Project Team&#8217;s first challenge was to prepare the 56 million MARC records for analysis.  This page has links to documents that provide information about the decomposition of the MARC records, the design of a MySQL database to hold the decomposed records, the database loading, and the validation of the parsing software that decomposed the records. It also includes a document that details the procedures for creating different subsets of the records for analysis by format. The final document presents the analytical questions that guided our analysis. </p>
<h4><u>Decomposing the Records, Database Structure, Database Loading, Validation Procedures</u></h4>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/DecompSpecificationswem11Jul2005.pdf" title="MCDU Project MARC Records Dataset: Decomposition Specification, Database Design, and Parser Software" /><img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/>  MCDU Project MARC Records Dataset: Decomposition Specification, Database Design, and Parser Software</a><br />
This document provides information about the MARC dataset, the specifications for decomposing the MARC record, the design of the database to hold the decomposed records, and the parsing software that was designed to decompose the records and load the data to the database.  Date Posted: July 15,  2005.<br /><br/></li>
<li><a href="http://www.mcdu.unt.edu/wp-content/ParserValidationProceduresFinalwem11July2005.pdf" title="Validation Procedures for MARC Record Parsing Software" /><img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/>  Validation Procedures for MARC Record Parsing Software</a><br />
This document describes the procedures for testing the parsing scripts used to decompose the MARC records. A sample of the raw MARC records from the dataset and the resulting parsed records are subjected to the validation procedures detailed below to verify the integrity of the software and ascertain that the data from the MARC records are correctly represented prior to loading into the database.  Date Posted: July 15,  2005.<br /><br/></li>
</ul>
<h4><u>The MCDU MARC Parser and Database Loader</u></h4>
<ul>
<li>The MCDU Project Team developed a custom application using the PHP scripting language to decompose the MARC records and to load the data into the MySQL database. The parsing software reads MARC records from the ASCII text file, parses the records in memory according to specifications described in the first document listed above, and inserts the records in to MySQL database.  The following is a brief description of the software functionality.  A user selects the source file containing MARC records, the number of records to skip at the beginning of the file, the number of records to process in a group, and the number of records to process from the file. The user then selects the output destination, which can be either Browser or Database. Choosing Browser displays the results of the parsing in the browser user interface. Browser output is used to validate the parsing function. If Database is chosen for the output, the system launches the database loader and moves the parsed data into the MySQL database. After a user clicks the Start Processing button, the parser performs processing, and the results are output to the browser or inserted to the database. For demonstration of this tool, we have disabled writing the data to the database. Click on the link below to see a demonstration of the parser tool. Date Posted: July 15,  2005.</li>
<p><a href="http://metamarc.lis.unt.edu/mcdu_p30.php"> Access the MCDU MARC Parser</a></p>
</ul>
<h4><u>Preparation of the Databases Containing MARC Records for Analyses</u></h4>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANSetDefintionExtractionQuerieswemFinal20Dec2005.pdf" title="Format Content Designation Analysis: Set Definition and Extraction Queries">  Format Content Designation Analysis: Set Definition and Extraction Queries  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/> </a></li>
<p>This document contains the procedures in the form of structured query language (SQL) queries designed to create the 20 format-specific MCDU project databases containing approximately 56 million MARC records from the OCLC WorldCat database. Natural language queries were developed and translated into SQL. Test queries were initially run against samples of the data and analyzed to ensure that the queries are properly formed and produce expected results.</p>
</ul>
<ul>
<li><a href="http://www.mcdu.unt.edu/wp-content/FANSetProfilingAnalysisQuerieswemFinal20Dec05.pdf" title="Format Content Designation Analysis: Set Profiling and Analysis Queries">Format Content Designation Analysis: Set Profiling and Analysis Queries  <img src="/wp-content/pdf.gif" alt=".pdf icon" border="0"/>  </a> </li>
<p>This document describes the questions we are asking of the data to address project research questions. Questions are transformed into SQL queries that result in reports produced by the MySQL database management system. The basic questions asked of the sets are similar, and the organization for basic analysis queries of the MCDU record sets are detailed in this document. The queries are organized into three categories: “General Profile Queries”, “Frequency Counts Queries”, and “Second Level Analysis Queries”. </p>
</ul>
<p>Back to <a href="http://www.mcdu.unt.edu/?p=41">Analysis Reports and Results</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=42</wfw:commentRss>
		</item>
		<item>
		<title>Analysis Reports &#038; Results</title>
		<link>http://www.mcdu.unt.edu/?p=41</link>
		<comments>http://www.mcdu.unt.edu/?p=41#comments</comments>
		<pubDate>Sat, 16 Dec 2006 00:57:47 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=41</guid>
		<description><![CDATA[Throughout the MCDU Project. we have publish on this site reports and other documents from the various analyses completed. This page links to documents that contain results of our analysis and other project deliverables. During the project we addressed a variety of project objectives and activities. We have organized documents and reports into several categories [...]]]></description>
			<content:encoded><![CDATA[<p>Throughout the MCDU Project. we have publish on this site reports and other documents from the various analyses completed. This page links to documents that contain results of our analysis and other project deliverables. During the project we addressed a variety of project objectives and activities. We have organized documents and reports into several categories (follow the links to find the documents and reports associated with these categories):</p>
<p><a href="http://www.mcdu.unt.edu/?p=42">Preparation of the Data for Analyses</a><br />
<a href="http://www.mcdu.unt.edu/?p=43">Results of Frequency Count Analyses</a><br />
<a href="http://www.mcdu.unt.edu/?p=45">Commonly Used MARC Elements</a><br />
<a href="http://www.mcdu.unt.edu/?p=46">Extent of Support for FRBR User Tasks</a><br />
<a href="http://www.mcdu.unt.edu/?p=47">MARC Content Designation Use over Time</a><br />
<a href="http://www.mcdu.unt.edu/?p=48">Identifying and Understanding Factors Affecting Catalogers&#8217; Utilization of MARC</a></p>
<p><em>Preparation of the Data for Analyses</em> addresses the various methods and procedures used in preparing the MCDU dataset of more than 56 million MARC bibliographic records for analyses.</p>
<p><em>Results of Frequency Count Analyses</em> lists the reports that contain the fundamental data upon which subsequent analyses were conducted. One report provides an overview profile of the entire MCDU dataset, and 20 separate reports present the frequency count on subsets of the entire dataset based on format of material described and the source of cataloging.</p>
<p><em>Commonly Used MARC Elements</em> reports on our analysis that determined commonly used elements (based on the frequency count analyses) for each of the ten formats. We provide context for these commonly occurring elements by comparing the elements with guidelines and recommendations fromthe Program on Cooperative Cataloging BIBCO core records, CONSER records, and also recommendations published by the Library of Congress for national level (full and minimal) records. </p>
<p><em>Extent of Support for FRBR User Tasks</em> examined the extent to which MARC elements used in the MCDU dataset support the four user tasks identified in the Functional Requirements for Bibliographic Records. We use a mapping by Tom Delsey of MARC elements that support each task, along with our frequency count analyses as the basis for the results reported.</p>
<p><em>MARC Content Designation Use over Time</em> reports on a second phase of analysis that attempted to take into account the dynamic and evolving nature of MARC and the CDS available to catalogers over time. We know that the CDS available to catalogers changed as MARC evolved. New CDS were added, some were deleted, and format integration brought together similar CDS from various formats.</p>
<p><em>Identifying and Understanding Factors Affecting Catalogers&#8217; Utilization of MARC</em> presents the final deliverable for the MCDU Project. We have developed a research plan that addresses the stakeholders involved in the cataloging enterprise to gain an understanding of the factors that influence and shape catalogers&#8217; decisions regarding the use/nonuse of available MARC CDS. The MCDU Project only looked at the artifacts of the cataloging enterprise (i.e., the actual MARC records). An important next step in a future project will be to engage with the people involved in the cataloging enterprise to understand the behaviors, technologies, policies, and other factors that produced the MARC records examined in the current project.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=41</wfw:commentRss>
		</item>
		<item>
		<title>Published Papers and Presentations</title>
		<link>http://www.mcdu.unt.edu/?p=40</link>
		<comments>http://www.mcdu.unt.edu/?p=40#comments</comments>
		<pubDate>Fri, 15 Dec 2006 15:26:07 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=40</guid>
		<description><![CDATA[The project team is committed to disseminating information about the MCDU Project. This page has links to presentations and published papers. 
Presentations

October 18, 2007. Georgia Council of Media Organizations (GaCOMO) Conference, Jekyll Island, GA. Presentation by Amy Eklund.  Comparing Cataloging Practices: An Analysis of MARC Use by Georgia Catalogers (Presentation) Comparing Cataloging Practices: An [...]]]></description>
			<content:encoded><![CDATA[<p>The project team is committed to disseminating information about the MCDU Project. This page has links to presentations and published papers. </p>
<h4><u>Presentations</u></h4>
<ul>
<li>October 18, 2007. Georgia Council of Media Organizations (GaCOMO) Conference, Jekyll Island, GA. Presentation by Amy Eklund.  <em><a href="http://www.mcdu.unt.edu/wp-content/GeorgiaLibAssocPresentationaeOct2007.pdf" title="Comparing Cataloging Practices: An Analysis of MARC Use by Georgia Catalogers">Comparing Cataloging Practices: An Analysis of MARC Use by Georgia Catalogers (Presentation)</a></em><em><a href="http://www.mcdu.unt.edu/wp-content/GeorgiaLibAssocHandoutaeOct2007.pdf" title="Comparing Cataloging Practices: An Analysis of MARC Use by Georgia Catalogers"> Comparing Cataloging Practices: An Analysis of MARC Use by Georgia Catalogers (Handout)</a></em></li>
<p>.</p>
<li>June 23, 2007. American Library Association Program, Sponsored by ALCTS and MARBI, Washington, DC.<em><a href="http://www.mcdu.unt.edu/wp-content/ALA2007Moen22June2007.pdf" title="Informing the Future of MARC: An Empirical Approach">Informing the Future of MARC: An Empirical Approach (Presentation by Moen and  Miksa)</a></em><em><a href="http://www.mcdu.unt.edu/wp-content/ALA2007ProgramHandout22June2007.pdf" title="Informing the Future of MARC: An Empirical Approach">Informing the Future of MARC: An Empirical Approach (Handout)</a></em><em><a href="http://www.mcdu.unt.edu/wp-content/ALA2007McCallumMARCFutures.pdf" title="MARC Futures">MARC Futures (Presentation by Sally H. McCallum, Library of Congress)</a></em></li>
<p>.</p>
<li>June 21, 2007. National Library of Medicine, Technical Services Division, Bethesda, MD. <em><a href="http://www.mcdu.unt.edu/wp-content/NLM2007Moen21June2007.pdf" title="Informing the Future of MARC: An Empirical Approach">Informing the Future of MARC: An Empirical Approach (Presentation)</a></em><em><a href="http://www.mcdu.unt.edu/wp-content/NLMMARCAnalysisHandoutwem19June2007.pdf" title="Informing the Future of MARC: An Empirical Approach">Informing the Future of MARC: An Empirical Approach (Handout)</a></em></li>
<p>.</p>
<li>August 17, 2006. Nevada Library Association Annual Conference, Las Vegas, NV. <em><a href="http://www.mcdu.unt.edu/wp-content/NVLA2006Moen18Aug2006.pdf" title="21st Century Catalogingï¿½Catalogersï¿½ Use of MARCï¿½Challenges and Opportunities">21stCentury Catalogingï¿½Catalogersï¿½ Use of MARCï¿½Challenges and Opportunities</a></em></li>
<p>. </p>
<li>August 8, 2006. Research Libraries Group User Forum, Washington, DC. <em><a href="http://www.mcdu.unt.edu/wp-content/RLGForum2006Moen6Aug2006.pdf" title="Catalogersï¿½ Use of MARC: Learning from Artifacts through Metadata Utilization Analysis">Catalogersï¿½ Use of MARC: Learning from Artifacts through Metadata Utilization Analysis</a></em></li>
<p>. </p>
<li>June 14, 2006. Joint Conference on Digital Libraries 2006, Chapel Hill, NC. <em><a href="http://www.mcdu.unt.edu/wp-content/JCDL2006Moen14JunRevised.pdf" title="Learning from Artifacts: Metadata Utilization Analysis">Learning from Artifacts: Metadata Utilization Analysis</a></em></li>
<p>. </p>
<li>May 3, 2006. Montana Library Association, Preconference on Catalogersï¿½ Use of MARC and 21st Century Cataloging: Challenges and Opportunities, Missoula MT. <em><a href="http://www.mcdu.unt.edu/wp-content/MARCUtilization.pdf" title="Catalogersï¿½ Use of MARC -- The MCDU Project:An Empirical Analysis of Metadata Utilization by Library Catalogers">Catalogersï¿½ Use of MARC &#8212; The MCDU Project:An Empirical Analysis of Metadata Utilization by Library Catalogers</a></em></li>
<p>. </p>
<li>April 13, 2006. College of Information Science and Technology, Drexel University, Philadelphia, PA. <em><a href="http://www.mcdu.unt.edu/wp-content/MCDUProjectDrexelwem13Apr06.pdf" title="An Empirical Investigation of Metadata Utilization by Library Catalogers: The MARC Content Designation Utilization Project">An Empirical Investigation of Metadata Utilization by Library Catalogers: The MARC Content Designation Utilization Project</a></em></li>
<li>February 16, 2006. WebWise 2006 , Los Angeles, CA. <em><a href="http://www.mcdu.unt.edu/wp-content/WebWise2006Moen25Jan06.pdf" title="Examining MARC Records as Artifacts Reflecting Metadata Utilization Decisions">Examining MARC Records as Artifacts Reflecting Metadata Utilization Decisions</a></em></li>
<li>January, 2006. Association for Library and Information Science Education, San Antonio, TX. <em> <a href="http://www.mcdu.unt.edu/wp-content/ALISE2006Polyakovv2.pdf" title="Preparing, Creating, and Managing a Large Dataset of MARC 21 Records for Research and Analysis. [Poster session]">Preparing, Creating, and Managing a Large Dataset of MARC 21 Records for Research and Analysis. [Poster session]</a></em></li>
</ul>
<h4><u>Published Papers</u></h4>
<ul>
<li><em><a href="http://www.mcdu.unt.edu/wp-content/UnderstandingSupportOfFRBRUserTasks_ASISTBulletin_Aug07.pdf" title="Understanding Support of FRBRâ€™s Four User Tasks in MARC-Encoded Bibliographic Records">Understanding Support of FRBRâ€™s Four User Tasks in MARC-Encoded Bibliographic Records.</a></em> Bulletin of the American Society for Information Science and Technology, August/September 2007.</li>
<li><em><a href="http://www.firstmonday.org/issues/issue11_8/moen/" title="Examining MARC Records as Artifacts Reflecting Metadata Utilization Decisions">Examining MARC Records as Artifacts Reflecting Metadata Utilization Decisions.</a></em> First Monday, volume 11, number 8 (August 2006).</li>
<li><em><a href="http://www.mcdu.unt.edu/wp-content/ISKO06sdm23May2006REVISED.pdf" title="Metadata Assistance of the Functional Requirements for Bibliographic Records' Four User Tasks: a report on the MARC Content Designation Utilization (MCDU) Project">Metadata Assistance of the Functional Requirements for Bibliographic Records&#8217; Four User Tasks: a report on the MARC Content Designation Utilization (MCDU) Project.</a></em> [PrePrint]. Published in <u>Knowledge Organization for a Global Learning Society: Proceedings of the 9th International Conference for Knowledge Organization. International Society for Knowledge Organization 9th International Conference. (Vienna, Austria. July 5-7, 2006). Advances in Knowledge Organization, vol 10</u>. Wurzburg: Ergon.</li>
<li><em><a href="http://doi.acm.org/10.1145/1141753.1141813" title="Learning from Artifacts: Metadata Utilization Analysis">Learning from Artifacts: Metadata Utilization Analysis.</a></em>In <u>International Conference on Digital Libraries Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries. (Chapel Hill, NC. June 11-15, 2006)</u>: Pages: 270 - 271  .
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=40</wfw:commentRss>
		</item>
		<item>
		<title>In Memory of Henriette Avram, 1919-2006</title>
		<link>http://www.mcdu.unt.edu/?p=39</link>
		<comments>http://www.mcdu.unt.edu/?p=39#comments</comments>
		<pubDate>Fri, 23 Jun 2006 13:49:21 +0000</pubDate>
		<dc:creator>William Moen</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=39</guid>
		<description><![CDATA[The MCDU Project is dedicated to Henriette D. Avram, fondly known as the Mother of MARC. During the 1960s, Henriette let a team of librarians at the Library of Congress in a development effort that resulted in the Machine-Readable Catalog record. An obituary from the New York Times provides a wonderful portrayal of this amazing [...]]]></description>
			<content:encoded><![CDATA[<p>The MCDU Project is dedicated to Henriette D. Avram, fondly known as the Mother of MARC. During the 1960s, Henriette let a team of librarians at the Library of Congress in a development effort that resulted in the Machine-Readable Catalog record. An <a href="http://www.nytimes.com/2006/05/03/us/03avram.html?ex=1304308800&#038;en=60308f72d854136a&#038;ei=5088&#038;partner=rssnyt&#038;emc=rss">obituary from the New York Times</a> provides a wonderful portrayal of this amazing librarian and information scientist.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=39</wfw:commentRss>
		</item>
		<item>
		<title>MCDU Press Release, May 2005</title>
		<link>http://www.mcdu.unt.edu/?p=35</link>
		<comments>http://www.mcdu.unt.edu/?p=35#comments</comments>
		<pubDate>Thu, 12 May 2005 19:29:00 +0000</pubDate>
		<dc:creator>Gregory Snyder</dc:creator>
		
		<category><![CDATA[Documents]]></category>

		<guid isPermaLink="false">http://www.mcdu.unt.edu/?p=35</guid>
		<description><![CDATA[Contact:
William E. Moen, Ph.D.    (wemoen@unt.edu)
Shawne D. Miksa, Ph. D.    (smiksa@unt.edu)
Corrie Marsh   (cmarsh@unt.edu)
Texas Center for Digital Knowledge
PO Box 311068
Denton TX 76203
Tel: 940.565.4552
Fax: 940.565.3101
FOR IMMEDIATE RELEASE (11 May 2005)
LIBRARY CATALOG RECORDS UNDER THE MICROSCOPE
DENTON, TX – University of North Texas School of Library and Information Science (SLIS) Professors Dr. [...]]]></description>
			<content:encoded><![CDATA[<p>Contact:<br />
William E. Moen, Ph.D.    (wemoen@unt.edu)<br />
Shawne D. Miksa, Ph. D.    (smiksa@unt.edu)<br />
Corrie Marsh   (cmarsh@unt.edu)<br />
Texas Center for Digital Knowledge<br />
PO Box 311068<br />
Denton TX 76203<br />
Tel: 940.565.4552<br />
Fax: 940.565.3101</p>
<p><strong>FOR IMMEDIATE RELEASE (11 May 2005)</strong></p>
<p><strong>LIBRARY CATALOG RECORDS UNDER THE MICROSCOPE</strong></p>
<p><strong>DENTON, TX –</strong> <a href="http://www.unt.edu/slis/">University of North Texas School of Library and Information Science (SLIS)</a> Professors Dr. William E. Moen and Dr. Shawne D. Miksa are studying library catalog records, but not for the purpose of finding books. They are examining how books and other library materials are represented through electronic codes in online library catalogs. The project, entitled MARC Content Designation Utilization: Inquiry and Analysis, is the largest scientifically-based study of coding practices in electronic library catalogs. During the course of the 2-year project, Drs. Moen and Miksa, Fellows at the <a href="http://www.txcdk.org/">Texas Center for Digital Knowledge (TxCDK)</a>, will investigate the extent of catalogers’ use of MARC 21, the mark-up language used by catalogers worldwide to create electronic catalog records. SLIS Ph.D student Serhiy Polyakov and Masters students Amy Eklund and Gregory Snyder serve as Research Assistants on the project.</p>
<p>The <a href="http://www.imls.gov/">Institute of Museum and Library Services (IMLS)</a>, an independent Federal grant-making agency dedicated to creating and sustaining a nation of learners by helping libraries and museums serve their communities, is funding the project with a National Leadership Grant of $233,115 to the University of North Texas-TxCDK.</p>
<p>MARC (Machine Readable Cataloging) records provide bibliographic information and descriptions of items in library collections, including books, sound recordings, computer files, and visual materials.   The data elements of MARC records form the foundation of most electronic library catalogs used in North America today, as well as in libraries around the world.  Development of the MARC format was begun almost forty years ago by an initiative of the Library of Congress and evolved into the current format, MARC 21, which emerged in the late 1990s.  The MARC 21 format for bibliographic data is maintained by the Library of Congress&#8217;s Network Development and MARC Standards Office and the National Library of Canada&#8217;s Standards and Support Office.</p>
<p>To obtain catalog records for the study the researchers turned to the Online Computer Library Center (OCLC, <a href="http://www.oclc.org">www.oclc.org</a>), the largest online catalog record source in the world. This nonprofit computer library service and research organization maintains the WorldCat database which contains unique bibliographic records shared and contributed by more than 50,000 libraries in 84 countries and territories around the world. OCLC initially agreed to supply 1 million library catalog records from the WorldCat database for the researchers. After recent discussions, OCLC agreed to provide the researchers with all of its approximately 55 million catalog records. This new development will significantly increase the accuracy of the research results.</p>
<p>Dr. Moen explains that current MARC 21 specifications define nearly 2000 fields and subfields available to library catalogers working to create catalog records. In the 2003 IMLS-funded study entitled the Z-Interoperability project, Moen discovered that very few of these fields are being used. In fact, Moen discovered that only 36 of the available MARC fields accounted for 80% of all utilization. These preliminary findings have important implications for library catalogers and other library and information science professionals, and form the basis for the current study.</p>
<p>An important goal of the project is to create tools for the future study of catalog records. Dr. Miksa describes how this project will provide research strategies to examine MARC records as artifacts of the cataloging process. She emphasizes that resulting data will greatly inform cataloging education and curricula which is critical to the continued development and improvement of information retrieval systems in libraries worldwide.</p>
<p>The project’s findings ultimately will lead to improved access to information in library catalogs. Dr. Samantha Hastings, SLIS Interim Dean, describes this project as “just the type of funded research that leads to core developments in our field…the first of its kind that will be an important contribution to what we know about how content in library catalogs is actually being coded for organization and access.  How the information is organized directly influences how people get the information they need.”</p>
<p>Details of the project can be found at <a href="http://www.mcdu.unt.edu">http://www.mcdu.unt.edu</a>, a website created and maintained by SLIS Masters student Bryce Benton.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mcdu.unt.edu/?feed=rss2&amp;p=35</wfw:commentRss>
		</item>
	</channel>
</rss>
