Configuring Link Manager Managing Linked Content with Link Manager

Managing Repository Content 2-25 ■ Microsoft Excel ■ OpenOffice Writer ■ OpenOffice Calc

2.6.1.3 Link Status

All new and existing links are managed during the indexing cycle. When content items are checked in, the accepted links in these content items are added to or updated in the ManagedLinks table. Additionally, existing links are evaluated for changes resulting from content items being checked in or deleted. As links are added or monitored, they are marked as either valid or invalid. When one content item in the system references another content item in the system, the resulting link is marked as valid. When an existing link references a content item that has been deleted, the link is reevaluated and the status changes from valid to invalid. Statuses are recorded as a Y valid or N invalid in the dLkState column of the ManagedLinks Table and displayed for the user in the State column of the Link Info page as Valid or Invalid. For more information on the Link Info page, see the Oracle Fusion Middleware Users Guide for Content Server.

2.6.2 Configuring Link Manager

The following Link Manager configuration variables can be specified in the IntradocDirconfigconfig.cfg file: ■ AllowForceDelete on page 2-25 ■ HasSiteStudio on page 2-25 ■ LkRefreshBatchSize on page 2-26 ■ LkRefreshErrorsAllowed on page 2-26 ■ LkRefreshErrorPercent on page 2-27 ■ LkRefreshErrorThreshold on page 2-27 ■ LkDisableOnRebuild on page 2-27 ■ LkDisallowConversionFormats on page 2-28 ■ LkReExtractOnRebuild on page 2-28 ■ LkIsSecureSearch on page 2-29

2.6.2.1 AllowForceDelete

Use AllowForceDelete to allow Content Server to delete content items that are referenced or used by other content items. AllowForceDelete=true configures Content Server to be allowed to delete a content item that is referenced as a link. Deleting a content item that is referenced by other content items invalidates the links. If the value is not set to true, a request to delete a referenced content item will fail. AllowForceDelete=false configures Content Server to refuse a request to delete a content item that is referenced as a link. This is the default setting.

2.6.2.2 HasSiteStudio

If Site Studio is installed, use HasSiteStudio to enable the Site Studio-specific link patterns for parsing friendly URLs for the pattern engine. For more detailed 2-26 Application Administrators Guide for Content Server information about the current integration of Site Studio with Link Manager, refer to Site Studio Integration on page 2-33. HasSiteStudio=true enables the Site Studio-specific link patterns. HasSiteStudio=false disables the Site Studio-specific link patterns. This is the default setting.

2.6.2.3 LkRefreshBatchSize

Use LkRefreshBatchSize to control the number of links submitted to Site Studio during a refresh process. By default, the batch size value is set to 100. Depending on the value, the LkRefreshBatchSize configuration variable can make the refresh processes more or less receptive to an abort request. For example, the lower the value, the more responsive the Aborting the Refresh Activity is regarding the current refresh process. Also, the calculation for LkRefreshErrorsAllowed is completed after rather than during each batch process. Therefore, the lower the value for LkRefreshBatchSize, the more likely it is that the refresh activity will promptly terminate because the number of allowed errors is more quickly exceeded. LkRefreshBatchSize=number For more information about the refresh processes available with Link Manager, refer to Managed Links Administration Page on page 2-35. The reason for this is that during refresh activities, all the links recognized as Site Studio links or those requiring Site Studio processing are grouped and sent to Site Studio as a batch. Consequently, refreshes are more efficient, but abort requests are unresponsive during this time because Site Studio is unaware of aborts and total error counts. However, Site Studio is aware of the number of errors it has encountered in the current batch. For this reason, the Link Manager’s abort calculation can not take place in all situations and the error configuration values percent and threshold are simply suggestions to Link Manager for when an abort should occur. Using LkRefreshBatchSize, however, enables you to more accurately control the abort receptivity during refresh activities that involve Site Studio link batches.

2.6.2.4 LkRefreshErrorsAllowed

Use LkRefreshErrorsAllowed to set an absolute error count for refresh processes. Once the set number of errors is encountered, the refresh activity will abort. By default, this configuration setting is not used. LkRefreshErrorsAllowed=number For more information about the refresh processes available with Link Manager, refer to Managed Links Administration Page on page 2-35. Note: If you are using Site Studio, LkRefreshBatchSize enables you to have better control over the refresh abort option than using the combined LkRefreshErrorPercent and LkRefreshErrorThreshold settings. For example, if you set the percent value to 5 and the threshold value to 20, you would expect the refresh activity to abort after the first error. However, Link Manager might actually process more than one error before aborting. Managing Repository Content 2-27 If you choose to set a value for the LkRefreshErrorsAllowed configuration setting, it could override the combination of values for threshold LkRefreshErrorThreshold and percent LkRefreshErrorPercent . For example, if the LkRefreshErrorsAllowed value is less than the calculated thresholdpercent value for linkserrors, then the refresh activity could abort even if the number of links processed does not exceed the threshold limit. Therefore, it is recommended that you use either the LkRefreshErrorsAllowed configuration setting or the combination of the LkRefreshErrorThreshold and LkRefreshErrorPercent configuration settings.

2.6.2.5 LkRefreshErrorPercent

This value is used with LkRefreshErrorThreshold to compute whether the refresh activity should be aborted. When the refresh activity processes more links than the threshold value, Link Manager calculates the proportion of errors. If the number of errors exceeds the percent value, the refresh activity is aborted. For example, if the threshold value is 300 and the percent value is 20, the refresh activity will abort if 60 errors are encountered after processing more than 300 links. By default, the percent value is set to 10. LkRefreshErrorPercent=number For more information about the refresh processes available with Link Manager, refer to Managed Links Administration Page on page 2-35. If you choose to set a value for the LkRefreshErrorsAllowed configuration setting, it could override the combination of values for threshold LkRefreshErrorThreshold and percent LkRefreshErrorPercent . For example, if the LkRefreshErrorsAllowed value is less than the calculated thresholdpercent value for linkserrors, then the refresh activity could abort even if the number of links processed does not exceed the threshold limit. Therefore, it is recommended that you use either the LkRefreshErrorsAllowed configuration setting or the combination of the LkRefreshErrorThreshold and LkRefreshErrorPercent configuration settings.

2.6.2.6 LkRefreshErrorThreshold

This value is used with LkRefreshErrorPercent to compute whether the refresh activity should be aborted. The LkRefreshErrorThreshold value indicates how many links need to be processed before the percentage of errors is calculated. Depending on the proportion of errors that are detected, the refresh activity may need to be aborted. For example, if the threshold value is 300 and the percent value is 20, the refresh activity will abort if 60 errors are encountered after processing more than 300 links. By default, the percent value is set to 10. By default, the threshold value is set to 100. LkRefreshErrorThreshold=number For more information about the refresh processes available with Link Manager, refer to Managed Links Administration Page on page 2-35 If you choose to set a value for the LkRefreshErrorsAllowed configuration setting, it could override the combination of values for threshold LkRefreshErrorThreshold and percent LkRefreshErrorPercent . For example, if the LkRefreshErrorsAllowed value is less than the calculated thresholdpercent value for linkserrors, then the refresh activity could abort even if the number of links processed does not exceed the threshold limit. Therefore, it is recommended that you use either the LkRefreshErrorsAllowed configuration setting or the combination of the LkRefreshErrorThreshold and LkRefreshErrorPercent configuration settings.

2.6.2.7 LkDisableOnRebuild

Use LkDisableOnRebuild to control link extraction during the indexing rebuild cycle. 2-28 Application Administrators Guide for Content Server LkDisableOnRebuild=true prevents Link Manager from extracting links during the indexing rebuild cycle. LkDisableOnRebuild=false configures Link Manager to extract links during the indexing rebuild cycle. This is the default setting. Because Link Manager does all of its work during the indexing cycle, it will increase the amount of time required to index documents and to rebuild collections. However, the time taken may not be noticeable since most of the time is spent indexing the document into the collection. Although, the amount of time required does depend on the type and size of the documents involved. That is, if the file needs to be converted, this requires more time than text-based HTML files. For more information about file formats, conversion, and link extraction, refer to Link Extraction Process on page 2-23 and File Formats and Conversion on page 2-24.

2.6.2.8 LkDisallowConversionFormats

Use LkDisallowConversionFormats to provide a list of file formats that will not be converted using HtmlExport before being processed for link extraction. There are various file formats such as Word that need to be converted by HtmlExport before links can be extracted. However, links in text-based file formats HTML can be extracted by Link Manager without requiring conversion by HtmlExport. For example, you may have PHP files or files in some other custom format that are actually text-based. For such files, conversion by HtmlExport may not be necessary before Link Manager processes them for link extraction. Such formats can be listed in this configuration variable. LkDisallowConversionFormats=format,...,format Where format is the MIME type. Example 1 LkDisallowConversionFormats=applicationmsword,audiowav,videoavi In this example, specifying the full MIME type format limits the variations of the listed types that are excluded. For example, listing applicationmsword will not exclude applicationvnd.msword or other variations. In this case, you must include each specific MIME type variation in the list that you want to exclude. Example 2 LkDisallowConversionFormats=msword,wav,avi In this example, all variations of each MIME type in the list are excluded. Using the shortened format of the MIME type is more advantageous in that it allows greater flexibility. Link Manager does not use HtmlExport to convert files that contain any of the following strings in the file format: hcs, htm, image, text, xml, jsp, and asp. These files are handled by Link Manager without need for conversion. To check the current file formats and extension mappings of your system, use the File Formats window in the Configuration Manager.

2.6.2.9 LkReExtractOnRebuild

Use LkReExtractOnRebuild to control link extraction from previously indexed documents during a rebuild. Managing Repository Content 2-29 LkReExtractOnRebuild=true configures Link Manager to extract links from documents as they are indexed into the system during a rebuild. This is the default setting. LkReExtractOnRebuild=false prevents Link Manager from extracting links from documents as they are indexed into the system during a rebuild For more information about file formats, conversion, and link extraction, refer to Link Extraction Process on page 2-23 and File Formats and Conversion on page 2-24.

2.6.2.10 LkIsSecureSearch

Use LkIsSecureSearch to manage security restriction checks for non-admin users during a link search. To implement the security check, Link Manager performs an inner join between the Revisions table and the ManagedLinks table, and applies standard Content Server security using security groups and accounts. LkIsSecureSearch=true configures Link Manager to perform a security check on non-admin users while implementing a managed link search. This is the default setting. LkIsSecureSearch=false prevents Link Manager from performing a security check on non-admin users while implementing a managed link search. Example If a document with the link is Secure and the linked document is public, a user without permissions to Secure will not be able to find the document containing the link in the Search Links results. If a document with the link is Public and the linked document is Secure, the document containing the link will be found in the Search Links results, but the same user will not be able to use the link that is, if it is a link to the web-viewable or the document information for the secure document.

2.6.3 Managing Links