spark.zip
大小:21MB
价格:10积分
下载量:0
评分:
5.0
上传者:2301_79070867
更新日期:2025-09-22

spark课程设计任务

资源文件列表(大概)

文件名
大小
spark/data.csv
65.61MB
spark/FoodSpark2/
-
spark/FoodSpark2/.idea/
-
spark/FoodSpark2/.idea/artifacts/
-
spark/FoodSpark2/.idea/artifacts/FoodSpark2_jar.xml
301B
spark/FoodSpark2/.idea/codeStyles/
-
spark/FoodSpark2/.idea/codeStyles/codeStyleConfig.xml
153B
spark/FoodSpark2/.idea/compiler.xml
543B
spark/FoodSpark2/.idea/misc.xml
541B
spark/FoodSpark2/.idea/scala_compiler.xml
191B
spark/FoodSpark2/.idea/workspace.xml
6.23KB
spark/FoodSpark2/out/
-
spark/FoodSpark2/out/artifacts/
-
spark/FoodSpark2/out/artifacts/FoodSpark2_jar/
-
spark/FoodSpark2/out/artifacts/FoodSpark2_jar/FoodSpark2.jar
27.79KB
spark/FoodSpark2/pom.xml
2.7KB
spark/FoodSpark2/src/
-
spark/FoodSpark2/src/main/
-
spark/FoodSpark2/src/main/java/
-
spark/FoodSpark2/src/main/resources/
-
spark/FoodSpark2/src/main/scala/
-
spark/FoodSpark2/src/main/scala/FoodSpark/
-
spark/FoodSpark2/src/main/scala/FoodSpark/FoodSpark.scala
6.63KB
spark/FoodSpark2/src/main/scala/FoodSpark/FoodSpark2.scala
6.62KB
spark/FoodSpark2/src/test/
-
spark/FoodSpark2/src/test/java/
-
spark/FoodSpark2/src/test/scala/
-
spark/FoodSpark2/target/
-
spark/FoodSpark2/target/classes/
-
spark/FoodSpark2/target/classes/FoodSpark/
-
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$1.class
1.12KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$2.class
1000B
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$3$$anonfun$apply$1.class
1.17KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$3.class
1.64KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$4.class
1.2KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$5.class
1.21KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$6.class
1.33KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$7.class
1.51KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$8.class
1.79KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$$anonfun$main$1.class
1.3KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark$.class
4.1KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark.class
1KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$1.class
1.13KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$2.class
1004B
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$3$$anonfun$apply$1.class
1.18KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$3.class
1.65KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$4.class
1.2KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$5.class
1.22KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$6.class
1.33KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$7.class
1.51KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$8.class
1.79KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$9.class
1.22KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$main$1.class
1.06KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$main$2.class
1.3KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$$anonfun$main$3.class
1.06KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2$.class
4.36KB
spark/FoodSpark2/target/classes/FoodSpark/FoodSpark2.class
1.17KB
spark/FoodSpark2/target/classes/META-INF/
-
spark/FoodSpark2/target/classes/META-INF/FoodSpark2.kotlin_module
16B
spark/FoodSpark2/target/generated-sources/
-
spark/FoodSpark2/target/generated-sources/annotations/
-

资源内容介绍

对数据文件data.csv在根据课程设计要求在IDEA中编写Spark程序并打包成jar包,并且将数据文件data.csv上传到HDFS中,提交jar包到Spark集群中运行。data.csv里面包含餐厅数据,可以用记事本打开查看,主要包含以下13个字段,字段之间由逗号分隔:"所属年月","商家名称","主营类型","店铺URL","特色菜","累计评论数","累计销售人次","店铺评分","本月销量","本月销售额","城市","商家地址","电话"课程设计要求完成以下任务,在IDEA中创建Maven项目,提供pom.xml文件给大家,完成以下任务。一、将data.csv文件上传到hdfs的/cateringdata/目录下。二、编写Spark程序,实现以下功能,并且将程序打包为jar包1.去掉"本月销量","本月销售额"(第8列和第9列)的数据异常(数据为空字符串或者null或者为0),并且统计去掉了多少条;这一步清洗获得的数据要以逗号分割,存到hdfs的/foodsparktask1目录下;2.去掉"店铺评分"数据为null的数据,并统计去掉了多少条;3.去掉"店
<?xml version="1.0" encoding="UTF-8"?><project version="4"> <component name="ArtifactsWorkspaceSettings"> <artifacts-to-build> <artifact name="FoodSpark2:jar" /> </artifacts-to-build> </component> <component name="AutoImportSettings"> <option name="autoReloadType" value="SELECTIVE" /> </component> <component name="ChangeListManager"> <list default="true" id="2f6761c6-62fb-48be-959b-884768030495" name="Default Changelist" comment="" /> <option name="SHOW_DIALOG" value="false" /> <option name="HIGHLIGHT_CONFLICTS" value="true" /> <option name="HIGHLIGHT_NON_ACTIVE_CHANGELIST" value="false" /> <option name="LAST_RESOLUTION" value="IGNORE" /> </component> <component name="CodeStyleSettingsInfer"> <option name="done" value="true" /> </component> <component name="MarkdownSettingsMigration"> <option name="stateVersion" value="1" /> </component> <component name="ProjectCodeStyleSettingsMigration"> <option name="version" value="2" /> </component> <component name="ProjectId" id="2hrnt4S2vdqV91WTfQqSRlO5Edh" /> <component name="ProjectViewState"> <option name="hideEmptyMiddlePackages" value="true" /> <option name="showLibraryContents" value="true" /> </component> <component name="PropertiesComponent"> <property name="last_opened_file_path" value="$PROJECT_DIR$" /> <property name="project.structure.last.edited" value="Artifacts" /> <property name="project.structure.proportion" value="0.15" /> <property name="project.structure.side.proportion" value="0.2" /> <property name="settings.editor.selected.configurable" value="preferences.lookFeel" /> </component> <component name="RecentsManager"> <key name="CopyClassDialog.RECENTS_KEY"> <recent name="FoodSpark" /> </key> </component> <component name="RunManager" selected="Application.FoodSpark"> <configuration name="FoodSpark" type="Application" factoryName="Application" temporary="true"> <option name="MAIN_CLASS_NAME" value="FoodSpark.FoodSpark" /> <module name="FoodSpark2" /> <extension name="coverage"> <pattern> <option name="PATTERN" value="FoodSpark.*" /> <option name="ENABLED" value="true" /> </pattern> </extension> <method v="2"> <option name="Make" enabled="true" /> </method> </configuration> <configuration name="Scala REPL" type="ScalaScriptConsoleRunConfiguration" factoryName="Scala Console" temporary="true" show_console_on_std_err="false" show_console_on_std_out="false"> <module name="FoodSpark2" /> <option name="allowRunningInParallel" value="false" /> <option name="javaOptions" value="-Djline.terminal=NONE" /> <option name="myConsoleArgs" value="" /> <option name="workingDirectory" value="$PROJECT_DIR$" /> <method v="2"> <option name="Make" enabled="true" /> </method> </configuration> <recent_temporary> <list> <item itemvalue="Application.FoodSpark" /> <item itemvalue="Scala REPL.Scala REPL" /> </list> </recent_temporary> </component> <component name="SpellCheckerSettings" BundledDictionaries="0" RuntimeDictionaries="0" Folders="0" CustomDictionaries="0" Dictionaries="0" CorrectionsLimit="5" DefaultDictionary="应用程序级" UseSingleDictionary="true" /> <component name="SvnConfiguration"> <configuration /> </component> <component name="TaskManager"> <task active="true" id="Default" summary="Default task"> <changelist id="2f6761c6-62fb-48be-959b-884768030495" name="Default Changelist" comment="" /> <created>1718361274808</created> <option name="number" value="Default" /> <option name="presentableId" value="Default" /> <updated>1718361274808</updated> </task> <servers /> </component> <component name="WindowStateProjectService"> <state x="634" y="250" key="FileChooserDialogImpl" timestamp="1718531781760"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state x="634" y="250" key="FileChooserDialogImpl/0.0.1707.1019@0.0.1707.1019" timestamp="1718531781760" /> <state width="1663" height="274" key="GridCell.Tab.0.bottom" timestamp="1718678758041"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state width="1663" height="274" key="GridCell.Tab.0.bottom/0.0.1707.1019@0.0.1707.1019" timestamp="1718678758041" /> <state width="1663" height="274" key="GridCell.Tab.0.center" timestamp="1718678758041"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state width="1663" height="274" key="GridCell.Tab.0.center/0.0.1707.1019@0.0.1707.1019" timestamp="1718678758041" /> <state width="1663" height="274" key="GridCell.Tab.0.left" timestamp="1718678758040"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state width="1663" height="274" key="GridCell.Tab.0.left/0.0.1707.1019@0.0.1707.1019" timestamp="1718678758040" /> <state width="1663" height="274" key="GridCell.Tab.0.right" timestamp="1718678758041"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state width="1663" height="274" key="GridCell.Tab.0.right/0.0.1707.1019@0.0.1707.1019" timestamp="1718678758041" /> <state x="334" y="84" key="SettingsEditor" timestamp="1718678034338"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state x="334" y="84" key="SettingsEditor/0.0.1707.1019@0.0.1707.1019" timestamp="1718678034338" /> <state x="93" y="93" width="1521" height="833" key="com.intellij.history.integration.ui.views.DirectoryHistoryDialog" timestamp="1718531318720"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state x="93" y="93" width="1521" height="833" key="com.intellij.history.integration.ui.views.DirectoryHistoryDialog/0.0.1707.1019@0.0.1707.1019" timestamp="1718531318720" /> <state x="538" y="334" key="com.intellij.ide.util.TipDialog" timestamp="1718716314753"> <screen x="0" y="0" width="1707" height="1019" /> </state> <state x="538" y="334" key="com.intellij.ide.util.TipDialog/0.0.1707.1019@0.0.1707.1019" timestamp="1718716314753" /> </component></project>

用户评论 (0)

发表评论

captcha

相关资源

windows11编译安装pysqlcipher3

windows11编译安装pysqlcipher3

152.5KB36积分

数据可视化的可视化大屏课程设计

1、选取合适的数据集:根据毕业设计的主题和目标,选择与之相关的合适的数据集。确保数据集可获得并包含足够的样本和特征,以支持后续的数据分析和开发工作。2、项目背景:介绍毕业设计的背景和动机,说明为什么选择该课题以及相关领域的研究现状和问题。3、项目目标:明确毕业设计的目标和预期成果,阐述希望通过这个项目实现的具体价值和意义。4、数据说明:详细描述所选数据集的来源、规模、结构和内容,包括数据的格式、字段含义、数据质量等信息。5、项目分析:对数据集进行初步的数据分析,包括数据预处理、特征提取、数据探索等,以获得对数据集的深入理解,为后续的开发工作做准备。6、开发环境:采用Echarts大数据可视化技术。项目开发采用当前最主流的前后端分离的方式:后端用Pycharm工具搭建Flask框架,然后利用Python技术完成数据清洗、数据制作,最终形成数据接口;前端用Vscode工具完成可视化大屏布局、用Echarts技术完成图形展示;前后端只通过数据接口交互。7、后端开发:根据项目需求和目标,进行后端开发工作,包括数据处理、函数的设计等,确保后端能够正确处理数据,并提供相应的接口和功能。

10.08MB35积分

windows11编译安装pysqlcipher3

windows11编译安装pysqlcipher3

2.64MB43积分

爬取豆瓣电影top250和爬取当当网数据

(2)编写爬虫程序,使用Urllib或Requests库获取到服务器URL的首页数据。(3)解析数据,包含图书编号、名称、作者、出版社、出版时间、价格、简介、图书图片的URL,同时实现翻页功能爬取全部网页数据;(4)数据持久化存储:将全部解析的数据存储到 .CSV文件;将全部图书的图片存储到当前目录中“download”文件夹;将全部解析的数据存储到数据库( MySQL或MongoDB )。 编写爬虫程序,使用获取到服务器URL的首页数据。(3)使用解析RE、BS4、XPATH数据,包含图书编号、名称、作者、出版社、出版时间、价格、简介、图书图片的URL,同时实现翻页功能爬取全部网页数据;(4)数据持久化存储:将全部解析的数据存储到 .CSV文件;将全部图书的图片存储到当前目录中“download”文件夹;将全部解析的数据存储到数据库( MySQL或MongoDB )。

1.95KB29积分