{"id":1852,"date":"2017-06-22T22:31:37","date_gmt":"2017-06-22T22:31:37","guid":{"rendered":"http:\/\/elbsolutions.com\/projects\/?p=1852"},"modified":"2022-02-03T11:24:29","modified_gmt":"2022-02-03T17:24:29","slug":"ripping-sharepoint-apart","status":"publish","type":"post","link":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/","title":{"rendered":"Ripping Sharepoint Metadata Apart"},"content":{"rendered":"<p>I <a href=\"http:\/\/elbsolutions.com\/projects\/extracting-sharepoint-library-data-and-attached-files\/\">previously blogged about exporting files and metadata from Sharepoint.<\/a> It worked!!!! Well, now that all the files are extracted &#8211; I have ended up with a csv.<\/p>\n<ol>\n<li>seems some double quotes are included &#8211; but a proper csv parser (eg. Excel) will turn those into 1 single quote<\/li>\n<li>The Xml column contains an xml node<\/li>\n<li>Add an xml header and a root node\n<ol>\n<li>&lt;?xml version=&#8221;1.0&#8243; encoding=&#8221;UTF-8&#8243; standalone=&#8221;no&#8221; ?&gt;<\/li>\n<li>&lt;root&gt;<\/li>\n<li>&lt;z:row &#8230;..<br \/>\n\/&gt;<\/li>\n<li>&lt;\/root&gt;<\/li>\n<\/ol>\n<\/li>\n<li>Then using Notepad++ with &#8220;XML Tools plugin installed&#8221; &#8211; you can surf the node path with ctrl-alt-shift-P\n<ol>\n<li>turns out to be \/root\/z:row<\/li>\n<li>a c# app &#8211;<a href=\"https:\/\/stackoverflow.com\/questions\/11295662\/sharepoint-xml-parse-c-sharp\"> try this article<\/a><\/li>\n<\/ol>\n<\/li>\n<li>Also &#8211; use the Plugins-&gt;xml tools-&gt;Pretty Print Attributes<\/li>\n<li>An XPath \/root\/z:row[@ows_MetaInfo] would get what we require\n<ol>\n<li>Parse this crazy thing with CRLF or<\/li>\n<li>The first number 1234;# is the id number &#8211; remove it<\/li>\n<li>field:TY|val\n<ol>\n<li>field name<\/li>\n<li>TY = type<\/li>\n<li>val = value<\/li>\n<\/ol>\n<\/li>\n<li><a href=\"https:\/\/stackoverflow.com\/questions\/10402891\/remove-all-but-a-specific-portion-of-a-string-in-javascript\">Use a regex like this to parse<\/a><\/li>\n<\/ol>\n<\/li>\n<li>Now .. undoing the whole thing by hand\n<ol>\n<li>undo the HTML Entities &#8211; now this will likely be done with the XML API\n<ol>\n<li>use the Notpad++ plugin HTML Tag (Plugin-&gt;Html Tag-&gt;Decode Enitites)<\/li>\n<\/ol>\n<\/li>\n<li>&amp;lt; -&gt; &lt; etc.<\/li>\n<li>unencode entities like #x0020 to a space etc.<\/li>\n<\/ol>\n<\/li>\n<li>To extract thumbnails &#8211; they are stored in Base64 &#8211; <a href=\"http:\/\/base64decode.net\/c-sharp-system-convert-frombase64string\">here is a c# app to decode<\/a> into jpegs<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I previously blogged about exporting files and metadata from Sharepoint. It worked!!!! Well, now that all the files are extracted &#8211; I have ended up with a csv. seems some double quotes are included &#8211; but a proper csv parser (eg. Excel) will turn those into 1 single quote The Xml column contains an xml [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-1852","post","type-post","status-publish","format-standard","hentry","category-general"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.7 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Ripping Sharepoint Metadata Apart - ELB Solutions.com Inc.<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Ripping Sharepoint Metadata Apart - ELB Solutions.com Inc.\" \/>\n<meta property=\"og:description\" content=\"I previously blogged about exporting files and metadata from Sharepoint. It worked!!!! Well, now that all the files are extracted &#8211; I have ended up with a csv. seems some double quotes are included &#8211; but a proper csv parser (eg. Excel) will turn those into 1 single quote The Xml column contains an xml [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/\" \/>\n<meta property=\"og:site_name\" content=\"ELB Solutions.com Inc.\" \/>\n<meta property=\"article:published_time\" content=\"2017-06-22T22:31:37+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2022-02-03T17:24:29+00:00\" \/>\n<meta name=\"author\" content=\"Etienne Bley\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Etienne Bley\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/\"},\"author\":{\"name\":\"Etienne Bley\",\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/#\\\/schema\\\/person\\\/51e717c68f4f5917c63baf88f0896c39\"},\"headline\":\"Ripping Sharepoint Metadata Apart\",\"datePublished\":\"2017-06-22T22:31:37+00:00\",\"dateModified\":\"2022-02-03T17:24:29+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/\"},\"wordCount\":242,\"articleSection\":[\"General\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/\",\"url\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/\",\"name\":\"Ripping Sharepoint Metadata Apart - ELB Solutions.com Inc.\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/#website\"},\"datePublished\":\"2017-06-22T22:31:37+00:00\",\"dateModified\":\"2022-02-03T17:24:29+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/#\\\/schema\\\/person\\\/51e717c68f4f5917c63baf88f0896c39\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/ripping-sharepoint-apart\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Ripping Sharepoint Metadata Apart\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/#website\",\"url\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/\",\"name\":\"ELB Solutions.com Inc.\",\"description\":\"Bringing all your IT Pieces together\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/#\\\/schema\\\/person\\\/51e717c68f4f5917c63baf88f0896c39\",\"name\":\"Etienne Bley\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/f8971dfb65b25b768415568f83247df4057f15d037137e386928a804e2c997b9?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/f8971dfb65b25b768415568f83247df4057f15d037137e386928a804e2c997b9?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/f8971dfb65b25b768415568f83247df4057f15d037137e386928a804e2c997b9?s=96&d=mm&r=g\",\"caption\":\"Etienne Bley\"},\"url\":\"https:\\\/\\\/elbsolutions.com\\\/projects\\\/author\\\/etienne-bley\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Ripping Sharepoint Metadata Apart - ELB Solutions.com Inc.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/","og_locale":"en_US","og_type":"article","og_title":"Ripping Sharepoint Metadata Apart - ELB Solutions.com Inc.","og_description":"I previously blogged about exporting files and metadata from Sharepoint. It worked!!!! Well, now that all the files are extracted &#8211; I have ended up with a csv. seems some double quotes are included &#8211; but a proper csv parser (eg. Excel) will turn those into 1 single quote The Xml column contains an xml [&hellip;]","og_url":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/","og_site_name":"ELB Solutions.com Inc.","article_published_time":"2017-06-22T22:31:37+00:00","article_modified_time":"2022-02-03T17:24:29+00:00","author":"Etienne Bley","twitter_misc":{"Written by":"Etienne Bley","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/#article","isPartOf":{"@id":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/"},"author":{"name":"Etienne Bley","@id":"https:\/\/elbsolutions.com\/projects\/#\/schema\/person\/51e717c68f4f5917c63baf88f0896c39"},"headline":"Ripping Sharepoint Metadata Apart","datePublished":"2017-06-22T22:31:37+00:00","dateModified":"2022-02-03T17:24:29+00:00","mainEntityOfPage":{"@id":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/"},"wordCount":242,"articleSection":["General"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/","url":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/","name":"Ripping Sharepoint Metadata Apart - ELB Solutions.com Inc.","isPartOf":{"@id":"https:\/\/elbsolutions.com\/projects\/#website"},"datePublished":"2017-06-22T22:31:37+00:00","dateModified":"2022-02-03T17:24:29+00:00","author":{"@id":"https:\/\/elbsolutions.com\/projects\/#\/schema\/person\/51e717c68f4f5917c63baf88f0896c39"},"breadcrumb":{"@id":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/elbsolutions.com\/projects\/ripping-sharepoint-apart\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/elbsolutions.com\/projects\/"},{"@type":"ListItem","position":2,"name":"Ripping Sharepoint Metadata Apart"}]},{"@type":"WebSite","@id":"https:\/\/elbsolutions.com\/projects\/#website","url":"https:\/\/elbsolutions.com\/projects\/","name":"ELB Solutions.com Inc.","description":"Bringing all your IT Pieces together","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/elbsolutions.com\/projects\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/elbsolutions.com\/projects\/#\/schema\/person\/51e717c68f4f5917c63baf88f0896c39","name":"Etienne Bley","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/f8971dfb65b25b768415568f83247df4057f15d037137e386928a804e2c997b9?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/f8971dfb65b25b768415568f83247df4057f15d037137e386928a804e2c997b9?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/f8971dfb65b25b768415568f83247df4057f15d037137e386928a804e2c997b9?s=96&d=mm&r=g","caption":"Etienne Bley"},"url":"https:\/\/elbsolutions.com\/projects\/author\/etienne-bley\/"}]}},"_links":{"self":[{"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/posts\/1852","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/comments?post=1852"}],"version-history":[{"count":8,"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/posts\/1852\/revisions"}],"predecessor-version":[{"id":1860,"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/posts\/1852\/revisions\/1860"}],"wp:attachment":[{"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/media?parent=1852"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/categories?post=1852"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/elbsolutions.com\/projects\/wp-json\/wp\/v2\/tags?post=1852"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}