Confluence is a team content collaboration software. Onna supports Confluence Cloud and Server version 5.7 and up. Onna connects directly with the API to collect all information in native format. The integration collects all data and metadata from an entire Confluence site or individual spaces.
To collect from Confluence on-premise you will need Onna's Discovery app and an Onna Enterprise account. For security reasons, we will also need the domain for the confluence site. Please email firstname.lastname@example.org with a request for the Discovery App and the domain for the collection. The turnaround on this process can be up to 24 hours.
All files are synced, including, but not limited to:
- HTML content of the page
- Comments on pages
- Attachments for the page
- Labels for attachments and pages
- Ancestors for the page/attachments
- Historical information and related metadata, including:
- Author of the page
- Created by/on
- Last updated by/on
- Previous Version created by/on
Types of Sync Available
For on-premise collections we only support one-time sync
- One-time sync collects information in an account until a specified date. It does not update once collected.
The synchronization scope currently encompasses entire Confluence sites, specific Confluence spaces, and specific Confluence pages.
All files and metadata can be exported in eDiscovery ready format. Load files are available in a dat, CSV, or custom text file.
The following metadata fields are exported:
- Space Name
- Space ID (numeric field to identify space in Confluence)
- Confluence Space Type
- Ancestors for a file
- List of Labels
- All date related metadata
How to Guide
First, install the app on a machine that is behind the desired firewall and that has constant connectivity to the Confluence server and Internet.
Note: Generally, a server, virtual or physical, is preferred over a desktop or laptop unless the machine will remain unlocked and have no interruptions to its connectivity. The app needs to be installed by a user with admin level access to that machine. If you have 2FA enabled for your Confluence site you may need to create a new account without it enabled.
The app will open onto a login screen similar to the platform's login.
After logging in with the same credentials you would use in the web platform, the app will open to your Workspaces page
Currently the workflow you'll have to follow is either:
- Creating a new Workspace for your collection
- Using an existing Workspace to add a data source
Inside the workspace, next click "Add new Source"
Currently, you can use the app to add a Confluence or Jira source.
First, name your source. This is the source's title on the platform. If you're naming for eDiscovery purposes a common convention is to name it after the company.
Enter the Confluence site's URL as the host. If the site is password-protected, enter your credentials here, including your full username's email. If the site is public, leave username and password blank. (See example below for collection from a public site). Once you've finished entering the details, click 'Connect'.
Select the space(s) you would like to sync. To sync all, select "All Spaces".
Once you have clicked "Sync", you will see this integration within your Groups page. You will also see it within your Sources page on the web platform.
Onna will begin to interact with Confluence's API and begin to sync files. Files will be processed and indexed so that all is searchable. A source will indicate that it's syncing during this process.
When you click on the Confluence data source, you will start seeing results being populated.
From this screen, you are able to filter results by date range, categories, and/or extensions using the menu on the left.
Confluence pages in Onna
For on-premise Confluence collections we render the pages collected in HTML.
Accessing audit logs
Clicking on the information icon on the top right will take you to the source details where you can see how many files it has and its size. Click on Audits to be taken to the Audit logs screen:
For Confluence on-premise collections, is it necessary to install anything on a server?
Yes, one needs to install an application on a Windows machine that is always on and has constant connectivity to the Confluence server and Internet.
Where will the information be stored for an on-premise collection?
This depends on the type of Onna deployment you're using. If you're using the cloud version of the platform then the information that you collect using the app will be uploaded to that Onna environment.
What type of login is needed - database or user?
A user account to Confluence with full access to the space(s) that need to be collected.