Information Architecture of Content Management
When people think about content management, they generally think about it from a systems perspective, focusing primarily on tools and technology. While it is true that content management usually requires a technological solution, it also requires that content be designed for reuse, retrieval, and delivery to meet your authors' and customers' needs. Content management requires that tools be configured to support authoring, reviewing, and publishing tasks, but first, those tasks must be designed. Designing content and the processes to create, review, and publish it is what information architecture is all about.
Information architecture has become synonymous with information architecture for the web. However, as more organizations are adopting content management systems to manage both web and enterprise content, there is a new area of information architecture emerging—the information architecture of content management. One of the key factors for a successful content management implementation is a solid information architecture. Too often organizations implement content management without identifying the authors' needs, without looking closely at the content to determine how it could be most effectively structured to support user or customer needs, and without analyzing their current and desired content life cycle. This results in resistance to adoption, increased costs, and failure to achieve the desired results. Information architecture can make a significant contribution to the success of your content management solution.
This is a view supported by Lou Rosenfeld (http://www.louisrosenfeld.com), an information architecture consultant and co-author of Information Architecture for the World Wide Web: Designing Large-Scale Web Sites [1]. Rosenfeld has been instrumental in establishing the industry of information architecture for the web and points out:
"When it comes to making content accessible, content management and information architecture are two sides of the same coin. Authors and end users alike, benefit from intelligent design and well-organized processes." [2]
People like Lou Rosenfeld, Peter Morville, Christina Wodtke and others in the information architecture and information design industry have laid the groundwork for a move to information architecture for content management beyond the web.
The components of information architecture for content management
There are a number of components of information architecture that are key in building a solid base for a content management implementation. They include analysis, content models, granularity, metadata, reuse and repository architectures, reuse management, and content control.
Analysis
Good information architecture requires that you start with a thorough analysis of your organizational needs, your current and desired content life cycle, your customers' needs, the state of your current content, and your technological requirements. During the analysis phase, you need to look at your content very closely to determine how it's put together and the types of content it contains. This will help you to determine opportunities for reuse. You also need to talk to the people who create and use the content to learn what their issues are. This will help you to determine problem areas in work processes that can be addressed in workflow.
Content models
One of the most critical phases of your information architecture is building the content models on which your content management strategy is based. Content modeling involves identifying and documenting the structure of your content in detail. During the content modeling phase, you determine the elements required for each information product (or output) and how each information product will be designed for optimum usability and reuse. Content models define the structure and organization of your information products, indicating which individual elements they contain, their frequency, and their usage (for example, is an element optional or mandatory). Models become the road map for your content and are used to develop DTDs and schemas (if you are using XML), or content frameworks and templates.
Granularity of content
Designing the granularity of your content can sometimes be problematic. Authors typically like content very granular so they know exactly what to put into an element (for example, overview, procedure step). Very granular content usually results from more semantic models (models with tags that indicate the meaning of the element such as “overview” instead of tags with generic names such as “body” or “para”). Highly semantic models are more problematic for style sheet designers because all unique elements require an individual style. Because semantic names by their nature are unique, all semantically-named elements require their own styles.
Granularity also affects how you reuse content. Content that is too granular can be difficult to manage in your content management system, but content that is not granular enough may not be as reusable. Accordingly, CMS developers may push back on the level of granularity, opting for content that is not granular. Analysis of reusability, authoring processes, and tools is important when determining granularity and as you develop your information architecture, you will make changes to your granularity as you determine the optimum level of granularity for everyone.
Metadata
There are typically two types of metadata: categorization metadata and element metadata. Users tend to retrieve information based on categorization metadata, whereas authors tend to retrieve information based on element metadata.
Categorization metadata is used extensively on websites to categorize content for effective retrieval. It is also used extensively in document management to classify documents for storage.
Authors, on the other hand, use element metadata to classify elements of content for reuse, retrieval, and tracking. Care should be taken to ensure that you can retrieve your elements once stored. Your ability to reuse information is only as good as your ability to find it. And if you employ systematic reuse (see Reuse architecture) your metadata must be very thorough so that the system can correctly find and populate the content into the required information products and into the required places within information products. Like granularity, metadata design also continues to develop as you refine your architecture.
Reuse architecture
Content can be reused within an information product, across information products, and potentially across the enterprise. Traditionally, the most common form of reuse has been opportunistic, meaning that authors make a decision whether to reuse content or not. However, opportunistic reuse is also the least efficient because it requires that authors know a reusable element exists and what it is called, then find the element and reuse it in their information product. In addition, if authors are not aware that an element already exists, they may recreate it causing multiple elements to proliferate in your content management system. This also makes it difficult to know which of the multiple elements is the definitive one.
Alternatively, systematic reuse is automatic reuse. Once specific content has been identified as reusable in a specific location, it is automatically inserted (auto-populated) into the appropriate locations. Authors do not have to determine if the reusable content exists or search for, retrieve it, and insert it into the appropriate places. Systematic reuse ensures that content is automatically reused where necessary, thus reducing the burden on authors. When designing your reuse architecture, considerable analysis of information products is required to decide which elements are systematically reusable and where.
Once you've decided which elements are systematically reusable, you create content and structure reuse maps as part of your reuse architecture. The content reuse maps identify where content can and should be reused and if it should be reused identically or can be used derivatively (with change). Content reuse maps are used by your content management system to programmatically (automatically) ensure that content is reused.
In addition to identifying content reuse, you need to identify structural reuse as part of your reuse architecture. Structural reuse identifies where common structures are reused. For example, you might have a product description element in a brochure, but you would also have a product description element on the web. Even though those product description elements may be structurally the same, they may contain different content. Structure reuse maps are used by DTD/template developers in creating consistent structures for authors to follow.
Repository architecture
The repository architecture defines how you will structure your repository. For example you may have “building block” directories that include content that is frequently reused (for example, glossary, procedures, product descriptions) and the remainder of your content stored in information product directories (for example, all brochures) that are further organized by product. Or you may decide to organize your content by product with each of the information products as a subset of the product. You need to determine what is the most effective repository structure for your needs. Note, however, that the identified structure is not a physical file structure. Content is stored in the database, not in directories. The repository structure enables your authors to easily find information.
Reuse management
An area of information architecture that is frequently overlooked is that of reuse management. If authors opportunistically reuse content and create derivatives of the content, it quickly becomes difficult to identify which element is the definitive one. Your content management system will end up looking like your current file structure and you will have no clear idea of what is source content, where content is reused, and if there are multiple versions of the same piece of content. Reuse management means creating rules to manage your reusable content. The reuse rules are formalized in your content management system through workflow and in your system configuration.
Content control
Content control, as part of your information architecture, identifies how your content should be managed. You need to determine how content should be controlled through its life cycle and what security should be applied to it. Content control is tightly integrated with your reuse management strategy and business practices and like reuse management, it is formalized in workflow.
Training
Training is key to content management success. Information architecture and content management are specific disciplines that require specialized knowledge and training. The Rockley Group Content Management Workshop Series is designed to provide you with a comprehensive understanding of the concepts and techniques required to develop and implement effective content management. Learn more here: http://www.rockley.com/workshops.htm.
Need for integration of IA and content management
Bob Boiko (http://www.metatorial.com), Director of the University of Washington's School Content management system evaluation lab, content management expert, and best-selling author of the book Content Management Bible [3], sums up the discussion of information architecture and content management very well:
“Content management is the dynamic organization of information architecture, business management, software and network engineering, content creation, and publications development. If you don't master each of these areas, CM will fail.”
If you don't get them to integrate, CM will fail. Information architecture is the structuring of information for effective management and presentation. While the discipline has focused more to date on the presentation side of structure, it is now turning solidly toward management. As it does, the tight connection between content management and information architecture is becoming clear. Information architects, like the building architects before them, create structures. They lay the foundations under and the frames around information. Content managers gather and dynamically deliver masses of information. Without a solid information structure at the core, a CMS effort can't get off the ground. At best, it will be hugely inefficient and at worst it will crumble under its own weight. Information architects have the skills to structure a content domain so that information can flow in a reasoned and efficient way. It flows in according to well understood rules of relevance, segmentation and tagging, and it flows out according to well understood rules of audience interest and use.
So, CM needs IA. But IA needs CM as well. CM provides a wider context for IA. It makes IA not just about the best page, or even the best site, but rather about the best system behind all the pages, sites and myriad other outlets for information. CM centralizes IA in the organization. It pushes IA upstream toward the center of the organizational information systems infrastructure. CM integrates IA with business management, software and network engineering, content creation, and its old friend publications development toward establishing a new concept of what it means to be an organization in the information age. [4]
References
1. Rosenfeld, Louis and Peter Morville. Information Architecture for the World Wide Web: Designing Large-Scale Web Sites. 2002. O'Reilly and Associates, 2nd Edition.
2. Rosenfeld, L. Email interview, February 2004.
3. Boiko, Bob. Content Management Bible. 2001. John Wiley & Sons.
4. Boiko, B. Email interview, February 2004.
About the author
Ann Rockley, president of The Rockley Group, Inc., has an international reputation in the field of content management. She is a fellow of the Society for Technical Communication, has a Master of Information of Science from the University of Toronto, and teaches Enterprise Content Management at the University of Toronto. Rockley is also involved in the founding of Content Management Professionals.
About The Rockley Group: The Rockley Group helps content managers and authors meet the increasing demands of creating, distributing and managing the content they create. Our team of experienced analysts bring a wide variety of expertise to the table and can help you avoid expensive pitfalls. Organizations of all sizes -- from small, privately-owned firms to multi-national Fortune 500 companies -- trust us with their most important content projects. We serve clients in the Financial, Life Sciences, and High Technology industries, as well as others in the Communication, Marketing and Retail Sales markets. We’ve developed content reuse solutions that reduce the cost and effort to produce complex information products including: marketing collateral, software documentation, online help, customer support materials, human resources content, as well as regulatory documents for pharmaceutical and medical device manufacturers.
