HTML Unicode (UTF-8) Reference - W3Schools

文章推薦指數: 80 %
投票人數:10人

Unicode is a character set. UTF-8 is encoding. Unicode is a list of characters with unique decimal numbers (code points). A = 65, B = 66, ... Tutorials References Exercises Videos Menu Login FreeWebsite GetCertified Pro HTML CSS JAVASCRIPT SQL PYTHON JAVA PHP BOOTSTRAP HOWTO W3.CSS C C++ C# REACT R JQUERY DJANGO TYPESCRIPT NODEJS MYSQL    Darkmode Darkcode × Tutorials HTMLandCSS LearnHTML LearnCSS LearnRWD LearnBootstrap LearnW3.CSS LearnColors LearnIcons LearnGraphics LearnSVG LearnCanvas LearnHowTo LearnSass DataAnalytics LearnAI LearnMachineLearning LearnDataScience LearnNumPy LearnPandas LearnSciPy LearnMatplotlib LearnStatistics LearnExcel XMLTutorials LearnXML LearnXMLAJAX LearnXMLDOM LearnXMLDTD LearnXMLSchema LearnXSLT LearnXPath LearnXQuery JavaScript LearnJavaScript LearnjQuery LearnReact LearnAngularJS LearnJSON LearnAJAX LearnAppML LearnW3.JS Programming LearnPython LearnJava LearnC LearnC++ LearnC# LearnR LearnKotlin LearnGo LearnDjango LearnTypeScript ServerSide LearnSQL LearnMySQL LearnPHP LearnASP LearnNode.js LearnRaspberryPi LearnGit LearnMongoDB LearnAWSCloud WebBuilding CreateaWebsiteNEW WhereToStart WebTemplates WebStatistics WebCertificates WebDevelopment CodeEditor TestYourTypingSpeed PlayaCodeGame CyberSecurity Accessibility DataAnalytics LearnAI LearnMachineLearning LearnDataScience LearnNumPy LearnPandas LearnSciPy LearnMatplotlib LearnStatistics LearnExcel LearnGoogleSheets XMLTutorials LearnXML LearnXMLAJAX LearnXMLDOM LearnXMLDTD LearnXMLSchema LearnXSLT LearnXPath LearnXQuery × References HTML HTMLTagReference HTMLBrowserSupport HTMLEventReference HTMLColorReference HTMLAttributeReference HTMLCanvasReference HTMLSVGReference GoogleMapsReference CSS CSSReference CSSBrowserSupport CSSSelectorReference Bootstrap3Reference Bootstrap4Reference W3.CSSReference IconReference SassReference JavaScript JavaScriptReference HTMLDOMReference jQueryReference AngularJSReference AppMLReference W3.JSReference Programming PythonReference JavaReference ServerSide SQLReference MySQLReference PHPReference ASPReference XML XMLDOMReference XMLHttpReference XSLTReference XMLSchemaReference CharacterSets HTMLCharacterSets HTMLASCII HTMLANSI HTMLWindows-1252 HTMLISO-8859-1 HTMLSymbols HTMLUTF-8 × ExercisesandQuizzes Exercises HTMLExercises CSSExercises JavaScriptExercises PythonExercises SQLExercises PHPExercises JavaExercises CExercises C++Exercises C#Exercises jQueryExercises React.jsExercises MySQLExercises Bootstrap5Exercises Bootstrap4Exercises Bootstrap3Exercises NumPyExercises PandasExercises SciPyExercises TypeScriptExercises ExcelExercises RExercises GitExercises KotlinExercises GoExercises MongoDBExercises Quizzes HTMLQuiz CSSQuiz JavaScriptQuiz PythonQuiz SQLQuiz PHPQuiz JavaQuiz CQuiz C++Quiz C#Quiz jQueryQuiz React.jsQuiz MySQLQuiz Bootstrap5Quiz Bootstrap4Quiz Bootstrap3Quiz NumPyQuiz PandasQuiz SciPyQuiz TypeScriptQuiz XMLQuiz RQuiz GitQuiz KotlinQuiz CyberSecurityQuiz AccessibilityQuiz Courses HTMLCourse CSSCourse JavaScriptCourse FrontEndCourse PythonCourse SQLCourse PHPCourse JavaCourse C++Course C#Course jQueryCourse React.jsCourse Bootstrap4Course Bootstrap3Course NumPyCourse PandasCourse TypeScriptCourse XMLCourse RCourse DataAnalyticsCourse CyberSecurityCourse AccessibilityCourse Certificates HTMLCertificate CSSCertificate JavaScriptCertificate FrontEndCertificate PythonCertificate SQLCertificate PHPCertificate JavaCertificate C++Certificate C#Certificate jQueryCertificate React.jsCertificate MySQLCertificate Bootstrap5Certificate Bootstrap4Certificate Bootstrap3Certificate TypeScriptCertificate XMLCertificate ExcelCertificate DataScienceCertificate CyberSecurityCertificate AccessibilityCertificate × Tutorials References Exercises GetCertified Spaces Videos Shop Pro HTMLCharsets HTMLCharsets HTMLASCII HTMLWIN-1252 HTMLISO-8859 HTMLSymbols HTMLUTF-8 HTMLUTF-8 LatinBasic LatinSupplement LatinExtendedA LatinExtendedB ModifierLetters DiacriticalMarks GreekandCoptic CyrillicBasic CyrillicSupplement HTMLSymbols GeneralPunctuation CurrencySymbols LetterlikeSymbols Arrows MathOperators BoxDrawings BlockElements GeometricShapes MiscSymbols Dingbats Emoji EmojiSmileys EmojiSkinTones HTMLEntities HTML4Entities HTML5EntitiesA HTML5EntitiesB HTML5EntitiesC HTML5EntitiesD HTML5EntitiesE HTML5EntitiesF HTML5EntitiesG HTML5EntitiesH HTML5EntitiesI HTML5EntitiesJ HTML5EntitiesK HTML5EntitiesL HTML5EntitiesM HTML5EntitiesN HTML5EntitiesO HTML5EntitiesP HTML5EntitiesQ HTML5EntitiesR HTML5EntitiesS HTML5EntitiesT HTML5EntitiesU HTML5EntitiesV HTML5EntitiesW HTML5EntitiesX HTML5EntitiesY HTML5EntitiesZ HTMLUnicode(UTF-8)Reference ❮Previous Next❯ TheUnicodeConsortium TheUnicodeConsortiumdevelopstheUnicodeStandard. TheirgoalistoreplacetheexistingcharactersetswithitsstandardUnicode TransformationFormat(UTF). TheUnicodeStandardhasbecomeasuccessandisimplementedin HTML,XML,Java,JavaScript,E-mail,ASP,PHP,etc.TheUnicodestandardisalso supportedinmanyoperatingsystemsandallmodernbrowsers. TheUnicodeConsortiumcooperateswiththeleadingstandardsdevelopment organizations,likeISO,W3C,andECMA. TheUnicodeCharacterSets Unicodecanbeimplementedbydifferentcharactersets.Themostcommonlyused encodingsareUTF-8andUTF-16: Character-set Description UTF-8 AcharacterinUTF8canbefrom1to4byteslong.UTF-8canrepresentanycharacterintheUnicodestandard.UTF-8isbackwardscompatiblewithASCII.UTF-8isthepreferredencodingfore-mailandwebpages UTF-16 16-bitUnicodeTransformationFormatisavariable-lengthcharacterencodingforUnicode,capableofencodingtheentireUnicoderepertoire.UTF-16isusedinmajoroperatingsystemsandenvironments,likeMicrosoftWindows,Javaand.NET. Tip: Thefirst128charactersofUnicode(whichcorrespondone-to-onewithASCII)are encodedusingasingleoctetwiththesamebinaryvalueasASCII,makingvalid ASCIItextvalidUTF-8-encodedUnicodeaswell. HTML4supportsUTF-8.HTML5supports bothUTF-8andUTF-16! TheHTML5Standard:UnicodeUTF-8 BecausethecharactersetsinISO-8859were limitedinsize,andnotcompatibleinmultilingualenvironments,the UnicodeConsortiumdevelopedtheUnicodeStandard. TheUnicodeStandardcovers(almost)allthecharacters,punctuations,andsymbolsinthe world. Unicodeenablesprocessing,storage,and transportoftextindependentofplatformandlanguage. ThedefaultcharacterencodinginHTML-5isUTF-8. IfanHTML5webpageusesadifferentcharactersetthanUTF-8,itshouldbe specifiedinthetaglike: Example TheDifferenceBetweenUnicodeandUTF-8 Unicodeisacharacterset.UTF-8isencoding. Unicodeisalistofcharacterswithuniquedecimalnumbers(codepoints).A= 65,B =66,C=67,.... Thislistofdecimalnumbersrepresentthestring"hello":104101108108111 Encodingishowthesenumbersaretranslatedintobinarynumberstobestored inacomputer: UTF-8encodingwillstore"hello"likethis(binary):011010000110010101101100 01101100 01101111 Encodingtranslatesnumbersintobinary.Character setstranslatescharacterstonumbers. HTML5UTF-8CharacterCodes BelowisalistofsomeoftheUTF-8charactercodessupportedbyHTML5: Charactercodes Decimal Hexadecimal C0ControlsandBasicLatin 0-127 0000-007F C1ControlsandLatin-1Supplement 128-255 0080-00FF LatinExtended-A 256-383 0100-017F LatinExtended-B 384-591 0180-024F SpacingModifiers 688-767 02B0-02FF DiacriticalMarks 768-879 0300-036F GreekandCoptic 880-1023 0370-03FF CyrillicBasic 1024-1279 0400-04FF CyrillicSupplement 1280-1327 0500-052F GeneralPunctuation 8192-8303 2000-206F CurrencySymbols 8352-8399 20A0-20CF LetterlikeSymbols 8448-8527 2100-214F Arrows 8592-8703 2190-21FF MathematicalOperators 8704-8959 2200-22FF BoxDrawings 9472-9599 2500-257F BlockElements 9600-9631 2580-259F GeometricShapes 9632-9727 25A0-25FF MiscellaneousSymbols 9728-9983 2600-26FF Dingbats 9984-10175 2700-27BF ❮Previous Next❯ NEW WejustlaunchedW3Schoolsvideos Explorenow COLORPICKER Getcertifiedbycompletingacoursetoday! w3schoolsCERTIFIED.2022 Getstarted CODEGAME PlayGame



請為這篇文章評分?