我是一名大學生,正在從事一個學期的項目,但我的項目遇到了瓶頸。在我繼續之前,請知道我查看了有關堆棧溢出的類似線程,它們似乎都與我的情況不符。我有一個從 pdf 生成的字符串輸入,其中包含來自表格的豐富數據。問題是,由于格式的原因,部門列的某些表條目從 1 行變為 2 行,我無法解決它。例如,PS 253(由我的算法處理得很好)嘛243HON(打破一切)我需要最終能夠將它們放在同一行并刪除 MA 之后的“\n”以將其發送到程序的其余部分。我嘗試在部門代碼 (MA) 之后檢查 \n 一兩個索引位置,并更改從中獲得 243HON 的索引,但這不起作用。我也試過 String = string.replaceAll("MA \n", "MA ") 如代碼所示。刪除 MA 和 \n 之間的空格沒有任何作用。這是我的代碼的相關部分。謝謝!public static String[] departments = {"\nAS","\nSF","\nAE","\nAF","\nAT","\nLAR","\nAMS","\nBIO","\nBA","\nCHM","\nLCH","\nCIV","\nCSO", "\nCOM","\nCEC","\nCS","\nCYB","\nEC","\nEE","\nEGR","\nEP","\nES","\nFA","\nGCS","\nHS","\nHON","\nHF","\nHU","\nMA","\nME","\nWX", "\nMSL","\nNSC","\nPE","\nPS","\nPSY","\nSIM","\nSS","\nSE","\nSP","\nSYS","\nUNIV","\nUA"};public static String[] departmentsFix = {"\nAS \n","\nSF \n","\nAE \n","\nAF \n","\nAT \n","\nLAR \n","\nAMS \n","\nBIO \n","\nBA \n","\nCHM \n","\nLCH \n","\nCIV \n","\nCSO \n", "\nCOM \n","\nCEC \n","\nCS \n","\nCYB \n","\nEC \n","\nEE \n","\nEGR \n","\nEP \n","\nES \n","\nFA \n","\nGCS \n","\nHS \n","\nHON \n","\nHF \n","\nHU \n","\nMA \n","\nME \n","\nWX \n", "\nMSL \n","\nNSC \n","\nPE \n","\nPS \n","\nPSY \n","\nSIM \n","\nSS \n","\nSE \n","\nSP \n","\nSYS \n","\nUNIV \n","\nUA \n"};public static void main(String[] args) { // TODO Auto-generated method stub Loader loader = new Loader(); try { File file = new File("C:\\Users\\User\\Desktop\\EclipseWorkspace\\SE 300\\ER_SCHED_PRT.pdf"); PDDocument document = PDDocument.load(file); PDFTextStripper s = new PDFTextStripper(); loader.content = s.getText(document); String[] splitString = loader.content.split("Instructor", 2); loader.content = splitString[1]; int index = 0; for (String y : departmentsFix) { //find any departments with a \n after them and replace it with a space loader.content = loader.content.replaceAll(y, departments[index] + " "); index++; }
1 回答

白衣染霜花
TA貢獻1796條經驗 獲得超10個贊
我剛修好。通過find函數,我發現格式不是\nMA\n,而是\nMA\r\n。改變它在很大程度上解決了一個無關緊要的小錯誤的問題,可以通過使用額外的空間來補償。盡管如此,感謝您的幫助。
添加回答
舉報
0/150
提交
取消