SharePoint Translation Service for Workplace Paperwork and PDFs
On February 14, 2024, Microsoft revealed particulars of the SharePoint On-line translation service, one of many capabilities obtainable by way of the SharePoint Premium license. The article included a pleasant provide for tenants to check SharePoint Translation with a promotion obtainable till the top of June 2024.
The provide covers translation of the primary a million characters without charge each month. After this level, the interpretation service prices $15.00 for 1 million characters. Microsoft estimates that 1 million characters to be roughly 500-750 pages relying on the density of the textual content. Like Microsoft 365 Backup and different SharePoint add-on companies, charging is on a pay-as-you-go (PAYG) foundation billed by way of a month-to-month Azure subscription. Earlier than utilizing a PAYG service, it’s essential to configure the Microsoft Syntex service settings to hyperlink the Azure subscription. After that, all it’s essential do is determine which websites to make translation obtainable. By default, translation is out there for all websites, however you possibly can limit the ability to as much as 100 particular websites (Determine 1).

In accordance with the documentation, the supported file varieties embody csv, .docx, .htm, .html, .markdown, .md, .msg, .pdf, .pptx, .txt, and .xlsx. Legacy Workplace codecs are supported too. SharePoint Translation creates the translated variations of recordsdata in these codecs within the equal “fashionable” format (for instance, .doc turns into .docx). The utmost supported file dimension for translation is 40 MB.
Like many bulletins for Microsoft 365 companies, it takes time for all the mandatory bits to indicate up in a tenant. I used to be capable of see a Translate possibility for paperwork just a few weeks in the past, however taking the choice did nothing. It’s solely within the final week or in order that translation works.
Translating Phrase Paperwork
I work on a number of paperwork day by day, principally to write down articles and chapter recordsdata for books. All of the recordsdata are formatted for U.S. English as a result of that’s the language used for publication on web sites or in eBooks. Most articles are Phrase paperwork of between 800 and 1,500 (2 to five pages). Translation of those recordsdata is simple – choose the doc, select the Translate possibility from the […] menu, and choose a goal language (Determine 2).

The drop-down menu exhibits a listing of the preferred languages. It’s doable to translate to any of the supported languages by specifying the language’s ISO code (for instance, ca for Catalan or da for Danish). If SharePoint search helps the language, it expands the language code and lets you submit the file for translation.
Normally, translation occurs in a short time. Generally issues didn’t occur fairly so quick, and I needed to await a number of hours earlier than a translated file appeared. And typically, translation merely didn’t work. You’ll be able to perceive that background processes working in a cloud service typically take longer than anticipated, however it’s disconcerting when nothing occurs after submitting a file for translation.
When the whole lot works, the translated file seems within the doc library the place the supply doc is saved. The translated file has the identical title because the supply doc appended with the ISO code for the goal language. For instance, the file for the French language translation for a doc named Translating SharePoint On-line Paperwork.docx is Translating SharePoint On-line Documents_fr.docx. The translated file inherits the properties of its supply, together with retention and sensitivity labels (if utilized).
No Translation for Protected Information
Talking of sensitivity labels, translation doesn’t work if the label utilized to a file consists of safety (encryption). Information with labels marking data with a sure sensitivity could be translated, however as soon as encryption is concerned, translation doesn’t work. Although SharePoint On-line shops protected recordsdata in an unprotected type and solely applies encryption when a file is downloaded, that is fully logical. To translate a file, SharePoint downloads it and sends it to the interpretation service. The sensitivity label that controls entry to the file doesn’t embody an entry proper for the interpretation service, so the service can’t open and course of the content material. The identical restriction applies to password-protected recordsdata. The dangerous factor is that customers can submit protected recordsdata for processing with none indication thereafter that translation is inconceivable.
Aside from noticing the presence of a newly translated file, customers or directors don’t obtain some other notification indicating that the interpretation was profitable. It might be good if customers may determine if they need to obtain notifications by way of e-mail, particularly when massive paperwork or massive portions of paperwork are processed.
Translated Output
Translation of Phrase paperwork preserves headings, textual content, headers, footers, and different components (Determine 3). Some overflowing of textual content throughout pages may occur attributable to translation utilizing totally different phrases or variety of phrases. That is particularly apparent when a doc consists of graphics that will not match on a web page due to added phrases. It’s subsequently essential to examine every web page to be sure that formatting flows as meant.

Translating Very Giant Paperwork
I attempted to emphasize SharePoint Translation by translating the Phrase doc for the present model of the Workplace 365 for IT Execs eBook to French (the place the title turns into “Workplace 365 pour les professionnels de l’informatique”). The interpretation took about ten minutes to course of the 31.7 MB file. Determine 4 exhibits the Phrase statistics for the English and French variations of the doc. You’ll be able to see that the depend of pages, phrases, and characters elevated within the French model.

The English model of the supply Phrase doc incorporates about 4 million characters. The sheer dimension of the translated file highlighted the need to examine the formatting of the transformed doc, notably the stream of paragraphs throughout pages. Small issues matter relating to formatting, and I couldn’t perceive why the translated model had random areas inserted into sentences at instances (Determine 5).

Extra importantly for technical content material just like the Workplace 365 for IT Execs eBook, SharePoint Translation messes with PowerShell and different code. It’s exhausting accountable the interpretation algorithm as a result of it primarily processes phrases and code consists of phrases, albeit some unusual phrases which might be typically organized in unusual methods. Take this instance of a PowerShell snippet the place Translation modified cmdlet and parameter names. One factor’s for positive: PowerShell will barf if given this code to run.
Get-UnifiedGroup -ResultSize Illimité | Objet de tri DisplayName | Sélectionner un alias d'objet, DisplayName, ManagedBy, AccessType, GroupMemberCount, GroupExternalMemberCount, WhenCreated | export-csv -chemin c :tempTenantGroups.CSV -noTypeInformation -encodage ascii Invoke-Merchandise C:tempTenantGroups.CSV
One other factor afflicting the translated output for code examples is when languages use totally different citation marks. Take this instance, which received’t work both:
Get-UnifiedGroup -Identification « [email protected] »
Audit Information
The SharePoint Translation service doesn’t create audit occasions when recordsdata are submitted for translation nor when the processing of the recordsdata succeeds or fails. Nevertheless, SharePoint On-line logs occasions when translated recordsdata are uploaded. Sometimes, three occasions seem:
- FileUploaded: The app@sharepoint account uploads a translated file. The file has a short lived file title.
- FileRenamed: The non permanent file title is changed with its everlasting model.
- FileModified: A number of information are captured when SharePoint updates the properties of the non permanent file to match its supply.
Listed below are some particulars extracted from the unified audit log to indicate the contents of the three audit information captured for a file translation:
UserPrincipalName : app@sharepoint Timestamp : 21-Mar-2024 12:23:41 fileName : ~tmp1B_Office 365 for IT Execs 10 - April 2024_fr.docx Operation : FileUploaded ObjectId : https://office365itpros.sharepoint.com/websites/O365ExchPro/Shared Paperwork/E book Information/2024 Version E book Information/~tmp1B_Office 365 for IT Execs 10 - April 2024_fr.docx UserPrincipalName : app@sharepoint Timestamp : 21-Mar-2024 12:23:51 fileName : ~tmp1B_Office 365 for IT Execs 10 - April 2024_fr.docx Operation : FileRenamed ObjectId : https://office365itpros.sharepoint.com/websites/O365ExchPro/Shared Paperwork/E book Information/2024 Version E book Information/~tmp1B_Office 365 for IT Execs 10 - April 2024_fr.docx Software : Media Evaluation and Transformation Service UserPrincipalName : app@sharepoint Timestamp : 21-Mar-2024 12:23:57 fileName : Workplace 365 for IT Execs 10 - April 2024_fr.docx Operation : FileModified ObjectId : https://office365itpros.sharepoint.com/websites/O365ExchPro/Shared Paperwork/E book Information/2024 Version E book Information/Workplace 365 for IT Execs 10 - April 2024_fr.docx
The audit information don’t inform us who submitted paperwork for translation. They solely inform concerning the profitable consequence of translation makes an attempt.
PowerPoint and PDFs
Given the success of translating massive Phrase paperwork, I made a decision to check with a PDF generated by Phrase from these recordsdata. The enter was a 32.1 MB file with out a sensitivity label. It doesn’t matter what I did, all makes an attempt to translate the PDF failed. Makes an attempt with smaller PDFs succeeded, so I ponder if a special dimension restrict applies to PDFs than to Workplace recordsdata.
I additionally tried translating PowerPoint shows. Though profitable, cautious evaluation of the output slides is critical as a result of the formatting drawback reappeared when textual content overflowed and interfered with the position of graphics.
Translation Prices
Over a day or so of testing translation towards Phrase paperwork, PowerPoint shows, and PDFs, I accrued about $82 (EUR75.19) of costs for my Azure subscription. Azure accrued the fees towards a useful resource known as microsoft.syntex/documentprocessorsresource. This is similar useful resource title utilized by Microsoft 365 Backup. To get extra perception into the fees for the interpretation service, I needed to seek the advice of the bill particulars view (Determine 6).

On the listing value of $15 per million characters, I assume I should have translated about 6.5 million characters (a million free, the rest paid). On condition that Microsoft 365 Backup may very well be described as a mission-critical utility, it’s curious that translation prices a lot greater than the monthy cost for Microsoft 365 Backup in my tenant.
No additional particulars for translation costs can be found from the Azure portal. I do not know how a lot it prices to translate particular person paperwork or if Microsoft costs for failed translation makes an attempt.
Among the paperwork I translated have been massive and I anticipated to see some costs, however not the billed bill. Clearly, I examined enthusiastically. Joking apart, this expertise factors to the necessity to limit the interpretation choice to particular websites the place translation is a enterprise want fairly than a pleasant to have.
Translation Good for Normal Workplace Paperwork
I believe that the goal for SharePoint Translation is unlikely to be paperwork that comprise code examples. Extra doubtless, Microsoft is aiming for a extra “normal” type of Workplace paperwork. In any case, the present provide is a superb alternative for tenants who function in multinational environments to check SharePoint Translation freed from cost. By submitting quite a lot of paperwork for translation, you’ll be capable to establish if and the place points exist.
Informing customers concerning the progress (or lack) of translations is an space the place I believe Microsoft may enhance. They may additionally do higher at figuring out the character and element of costs accrued towards the Azure subscription in order that organizations could make inside chargebacks or just monitor consumption higher.
Total, the underside line is that if it’s essential translate paperwork, SharePoint Translation is a good way to generate a place to begin for human translators to finish the duty.
+ There are no comments
Add yours